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Welcome to the second issue of Continuum. Our previous issue, dated Winter 2016, is available 
through the St. Clair County Community College Library. We’re proud to bring you the best original work in 
mathematics, both pure and applied, from the faculty (both full and part time) at St. Clair County Community 
College. We’re very excited for this edition to also feature the work of one of our outstanding students, 
Rebekah Muzzi. 

We hope to bring out a new edition on an annual basis. Last time around, we were pleased to be able to 
say that all seven of our full-time mathematics professors contributed. This time, our focus was including 
contributors from outside our discipline. The results: two applied mathematics articles, one from Dr. Janice 
Fritz in our biology discipline, and another from Dr. James Mattiello, an adjunct professor who passed away 
just after completing this remarkable article. We are particularly proud to present this work by a truly brilliant 
physicist. 

The articles here are intended mostly for mathematics enthusiasts but also offer teaching ideas, and the 
background knowledge required varies. Even if you know but little mathematics, the article about the Online 
Encyclopedia of Integer Sequences should be approachable. Rebekah’s article, on the other hand, is highly 
recommended for readers with a fair amount of calculus knowledge. We’ve also included a couple of brief 
mathematician biographies. In future issues we would love to include responses to any inquiries, or articles 
tailored to reader requests. One idea we may incorporate is an “Ask a Mathematician” column, in which we 
answer your questions. 

If you would like to comment, ask us anything, or offer any input whatsoever, we would love to hear 
from you. Please email or write to the addresses below. If you wish your comments forwarded to any of our 
authors, we will do that. Please include explicit permission to publish your letters, if you wish to grant it, and 
we may print some letters in the next edition. 

Thank you for your interest, and we hope you find something to stimulate your imagination and make 
you think. 

Paul Bedard: pbedard @ sc4.edu 
Nick Goins: ngoins@sc4.edu 

Or write us care of: 

St Clair County Community College 
323 Erie Street 
PO Box 5015 

Port Huron, MI 48061-5015 
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Fun with The Online Encyclopedia of Integer Sequences (OEIS) 


Paul Bedard 

Introduction 


The first mathematics a child learns is an integer sequence. 

1,2,3,4, 5, — 

It’s just counting, or “saying your numbers.” Later, students learn their “times tables” - merely more sequences 


4,8,12,16, 20, ■ 


This is the list of the multiples of 4. An old Persian folktale tells of a foolish king who offered a wise man any 
reward he could name. The wise man asked for a chess board with one grain of wheat on the first square, two 
on the second, four on the third, and so on - each square doubling the number of grains. The king agreed, not 
knowing how quickly exponential functions grow. The sequence 

1,2,4, 8,16,- 

grows to 2 63 or 9,223,372,036,854,775,808 by the 64 th entry (the last square on the chess board.) 

There is a fascination in lists, and lists with a pattern are more intriguing still. In elementary school, my 
friend Bill Mullan and I used to challenge each other daily. One of us would call out the beginning of a 
sequence across the playground, and the other, wherever he was, would have to continue it if he could. Neither 
of us ever stumped the other. There are so many possibilities! One of my favorites is the “pancake sequence.” 

Question. What is the largest number of pieces a pancake can be cut into with one straight cut? 

Of course, there is no way to get anything other than two pieces. What about two cuts? Draw a circle and then 
draw straight lines across the circle. You will see that you can get 4 pieces with two cuts, if the cuts intersect 
each other. What about three cuts? If you cut the pancake like a pizza, you will get 6 pieces. But, if the first 
two cuts intersect each other and the third cut does NOT pass through the point of intersection of the first two - 
voila! Seven pieces! So, 


2,4,7, 


is the beginning of the pancake sequence. 
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The First Four Terms Of The Pancake Sequence 





Exercise. Find the next term in the pancake sequence. 

Here is a very challenging sequence that will require some “outside the box” thinking! 

3, 3, 5,4, 4,3,5, 5,4, 3, 6, ••• 

I wish I had thought of this one for Bill Mullan! But it isn’t really fair. You are probably looking for a 
mathematical pattern. I have simply listed the number of letters in the English words 

one, two, three, four, five, six, seven, eight, nine, ten, eleven, ■ ■ ■ 

By now you must be wondering about the “...” I keep using. These three dots are called an ellipsis, and 
they represent the fact that the sequence continues, following the same pattern, forever. The ellipsis can 
sometimes be misleading, so it should never be used until a clear pattern has been established. For instance, if I 
write 2,4,6,..., then the reader spots quickly that I am listing even integers, and assumes that the sequence 
continues .. .,8, 10,12, ... However, I may have intended some other pattern that just coincidentally starts with 
2, 4, 6. 

Perhaps you are curious about the fourth entry in the pancake sequence. It’s time I introduce the real 
star of this story - the OEIS, the Online Encyclopedia of Integer Sequences. 

https://oeis.org/ 

With this wonderful tool, we can explore all sorts of exciting sequences. What’s more, the Encyclopedia 
explains how the numbers are found, and also mentions other ways the same list can be created. For instance, 

1 , 3 , 6 , 10 , 15 ,- 
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is a sequence called the “triangular numbers.” Think of an array of bowling pins: 


oooo 
o o o 
o o 
o 

There are ten pins. But what if you made an array with only three rows of pins? 


ooo 
o o 
o 


But the triangular numbers are also: 

o The number of legal ways to insert a pair of parentheses in a string of n letters, 
o The number of tiles in a set of double - n dominoes, 
o The number of ways a chain of n non-identical links can be broken up. 

o The maximum number of intersections of n+1 lines which may only have 2 lines per intersection 
point. 

o Number of ways two different numbers can be selected from the set {0,l,2,...,n} without 

repetition, or, number of ways two different numbers can be selected from the set {l,2,...,n} with 
repetition. 

If any of the above doesn’t make sense, look it up! (The above bulleted points are not my words: they are a 
direct quote from the OEIS.) 

The OEIS was started by Neil Sloane in 1964. “The sequence database was begun by Neil J. A. Sloane 
in early 1964 when he was a graduate student at Cornell University in Ithaca, NY.” (OEIS). Sloane began to 
curate interesting sequences by writing them on file cards. 2,372 of these sequences were recorded in a book, 

A Handbook of Integer Sequences 1 ’, Academic Press, NY in 1973. Now, anyone who discovers a sequence can 
propose it for addition to the database. (It is not, however, easy to come up with one nowadays that no one else 
has thought of! More on this later.) As of this writing, the OEIS contains Read more about the history at 
https ://oeis. or g/wiki/W elcome . 

Exercise 1: Derive the next 5 terms in the sequence of triangular numbers. 

Exercise 2: Derive the 4 th term in the pancake sequence. 

Exercise 3: Find each of the sequences given above in the OEIS and state their catalog reference. 


Some Basic Sequence Facts 


Before we start exploring, I will introduce some notation and terminology. A sequence is a lsit of 
numbers, separated by commas. A particular number in a sequence is called an entry of the sequence. An 
integer is a number which is either a counting number (1,2, 3,4, ...), or the opposite (negative) of a counting 
number, or zero. An integer sequence is a sequence all the entries of which are integers. 
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A general entry in a sequence (that is, an entry we want to discuss without necessarily knowing its 
value) is usually referred to by the lower case letter a, with a subscript denoting the placement within the 
sequence. So, a 5 is the fifth entry in a sequence. 

The expression {a n } represents an entire sequence. 

A formula for a sequence can be given by referring to the subscript n as though it were a variable For example, 
the sequence of odd positive integers 


1, 3,5,7,9,11,- 

can be generated using the formula a n — 2n— 1. To find any entry, say a 5 , we simply evaluate the formula for 
that value of n. 


a 5 = 2(5) —1 = 9 


There is another way a formula for a sequence can be given: a recursive formula shows how to obtain an entry 
in a sequence by using previous entries. The most famous recursive formula is the one that defines the 

Fibonacci sequence: 


®n-1 3” ®n-2 

In ordinary words, this formula says that each entry in the Fibonacci sequence is the sum of the two entries that 
come right before it. But this implies that we have to be told what the very first two entries are, since there is no 
way to use the recursive formula to obtain them The first two entries are both l’s. 


Why Just Integers? 


Many important and fascinating sequences have noninteger entries. For instance, the harmonic 
sequence 

111111 
r 2' 3' 4' 5’ 6 ’"' 


Such sequences are well worth our consideration! However, there are two important reasons to look at integer 
sequences separately. Firstly, it will be far easier to get a hold of some of the concepts we need if we restrict 
ourselves to just the integers. Secondly, many important facts about noninteger sequences can be discovered by 
looking at related integer sequences. Let’s make this our first exploration. 

Let’s take a sequence that is decidedly nonintegral: the sequence of square roots of positive integers. 
This sequence is defined by the formula a n — yfn. The first few terms are 

Vl,V2,V3,V4,V5,- 

Some of these entries are in fact integers. The noninteger entries are all irrational numbers (this is moderately 
easy to prove!) An irrational number has a decimal representation that does not terminate or repeat. In other 
words, we cannot write out all the digits of an irrational number. 
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Let’s write out the first four digits of the numbers in our sequence: 


1.000,1.414,1.732,2.000, 2.236, - 


How can we make this an interesting sequence? Before I explain this, please do the following: using a 
computer or calculator, list the first 20 square roots, with only three decimal places after the decimal point for 
each. I will leave some space for your sequence. 

Okay, now we will turn this into an integer sequence by listing only the first digit of each square root. In other 
words, we will take the “integer part” of each of these numbers. 


1, 1, 1, 2, 2, 2, 2, 2, 3, 3, 3, 3, 3, 3, 3,3, 4, 4, 4, 4, 


This is sequence A000196 in the OEIS. What useful information can this list provide us with? Looking it up, 
we see that it also tells us the number of perfect square less than or equal to n. In other words, since a 8 = 2, 
there are two perfect squares less than 8 (they are 1 and 4, of course.) 

Let’s prove this. 

Theorem 1: The integer part of the square root of a positive integer is equal to the number of perfect square 
less than or equal to that integer. 

Proof. Let n be a positive integer. Suppose there are k perfect squares less than or equal to n. Then these 
perfect squares are 


1 2 ,2 2 ,3 2 ,-,k 2 

If k 2 = n then k — Vn and the integer part of y/n is k. This would satisfy the theorem. So let’s assume k 2 < 
n. 

Let us say that the integer part of 4n is m 
Suppose m> k. 

Then m 2 > k 2 but we know that m 2 < n. Since k 2 < n we know that k 2 < m 2 < n. Then m 2 would be a 
perfect square between k 2 and n, which is a contradiction. So m < k. 

Suppose m < k. Then m 2 < k 2 . Since m is the integer part of Vn , ( m + l) 2 > n. Since there are no perfect 
squares between m 2 and (m + l) 2 and m 2 < n < (m + l) 2 , then there are k perfect squares less than (m + 
l) 2 . But m 2 is one of these, and the largest of these is k 2 , so m 2 < k 2 . This is a contradiction. 

Therefore m = k. 

Let’s go back to the harmonic sequence for a moment. What sequence of integers naturally arises from this? 
Consider the partial sums of a n = 1/n. 
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1 1 1 _ 11 
1 + 2 + 3 _ ~6 
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1 1 1 1 _ 50 _ 25 

T + 2 + 3 + 4 ~~ 24 ~~ 12 

1 1 1 1 1 _ 137 

1 + 2 + 3 + 4 + 5 _ ~60~ 

Let’s consider the sequence of just the numerators of these fractions: 

1,3,11,25,137,- 

This turns out to be sequence A001008 in the OEIS. We learn that for prime numbers p greater than 3, p 2 
evenly divides a(p — 1) for any a in this sequence! We won’t prove that here. But it is interesting that a 
number theoretical fact about primes emerged from our investigation into this sequence. 

Exercise 4: How many perfect squares are less than 11,208? 

Finding More Sequences 

I enjoy typing random strings of integers into the OEIs and seeing what comes up. It’s truly amazing. 
Let’s type some integers at random, with duplications: 

1,1,1,2, 2, 3,4,4, 5 

The results? 

• The initial digits of perfect cubes (starting with 10 3 ); 

• Number of partitions of n into distinct parts, none being 7; 

• Number of partitions of n into 8 primes; 

• Number of partitions of n into 9 primes; 

• Number of partitions of n into 8 primes; 

• Expansion of 

1 

(1 — x)(l — x 2 )(l — X 5 ) 

• Number of 7’s in all the partitions of n into distinct parts. 

This exploration has introduced us to the idea of partitions! Let’s explore that a little further. 

Definition. A partition of a positive integer is a presentation of the integer as a sum of integers. 
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Examples. 


1 

2 = 1 + 1 

3=2+l=l+l+l 

4=3+l=2+2=2+l+l=l+l+l+l 

Including the integer itself, we have seen that 1 has one partition, 2, has two, 3 has three, and 4 has five. Let’s 
add one more before we check OEIS. 

5=4+l=3+2=3+l+l=2+2+l=2+l+l+l+l=l+l+l+l+l 
So, our sequence representing “the number of partitions of n” is 

1, 2, 3, 5, 7, ••• 


Let’s enter it! It’s A000041. 

Now that we have learned a little about partitions, we can look at some of the other results. 

Number of partitions of n into distinct parts, none being 7. A015754 

Distinct parts means we cannot re-use integers. Going back to our examples above, 

1 

2 = +4+4 

3 = 2 + 1 = 1 + 1 + 1 

4 = 3 + 1= 2-4—2 = 2 + 1 + 1 = 1 + 1 + 1 + 1 

5 = 4 + l = 3 + 2 = 3 + 1 + 1 = 2 + 2 + 1 = 2 + 1 + 1 + 1 + 1 = l + l + l + l + l 
Adding 6 = 5 + l = 4 + 2 = 3 + 2 + l 

The sequence of partitions of n into distinct parts is therefore 

1, 1,2, 2, 3,4,- 

Checking the OEIS for confirmation, this is indeed A000009; wow! Only the ninth sequence on the list! (Note 
that our sequence starts with the seventh Padovan number.) 

While I was checking for this, I encountered the Padovan sequence, A000931 - this is brand new to me! What 
can we learn here? 

The OEIS defines the Padovan numbers recursively -just like the Fibonacci numbers! The recursive formula is 

a(n) = a(n — 2) + a(n — 3), a(0) = 1, a( 1) = a( 2) = 0 
Exercise: Write out the first 15 Padovan numbers. Check your work in the OEIS! 
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Let’s look at just one of many uses of this sequence, quoted directly from the OEIS: 

a (n) counts closed walks from a vertex of a unidirectional triangle 
containing an opposing directed edge (arc) between the second and third 
vertices. Equivalently the (1,1) entry of A A n where the adjacency matrix 
of digraph is A=(0,1,0;0,0,1;1,1,0) . - David Neil McGrath , Dec 19 2014 


What does this mean? Well, here we have entered the exciting world of graph theory! Graph theory 
deals with the study of objects called graphs, which are collections of points (or vertices, or nodes) and lines 
(or edges). Draw a few points. Connect some of them (or all of them) with edges. You have created a graph! 
Graphs are also called networks, and they can model the flow of traffic, information, or just about anything, 
from point to point. Two of the most famous problems addressed by graph theory are the Four Color Problem 
and the Konigsberg Bridge Problem. (I suggest you look these up!) A directed graph or digraph is a graph 
where each edge has an associated direction. A directed edge can be called an arc. Directed graphs can be used 
to model the spread of ideas, diseases, or rumors from person to person or machine to machine in a network. 

Let’s try to figure out David McGrath’s idea. 


A 



We now need some more definitions. A walk is a sequence of vertices that we can reach along edges. 
So ABC would be a walk and so would ABCAB. A closed walk is a walk that starts and ends at the same 
vertex. 

Starting with A, we could return to A by the closed walk ABCA. This walk includes three edges, edge 
AB, edge BC, and edge AC. Therefore it has a length of 3. 

Is there any other way to do it? 

There are no closed walks of length 4. 
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Can you find any other walks of lengths 3 or 4? 

There are no closed walks of length 4. 

What about length 5? 

ABCBCA is a closed walk of length 5. 

Let’s put this together in a chart. 

n closed walks of length n 
0 1* 

1 0 

2 0 

3 1 

4 0 

5 1 

6 1 

7 1 

8 2 

*(This means there is one way to get from A to A with a walk of length 0 - just stay at A!) 

Exercise. What are the two closed walks of length 8? 

Answer: ABCABCBCA and ABCBCABCA. 

The adjacency matrix of a digraph is a matrix with an entry 1 if a path exists from the vertex labelling the row 
of the matrix to the vertex labelling the column, and entry 0 if no such path exists. The adjacency matrix for 
this digraph is 

0 1 O' 

0 0 1 
.1 1 0. 

The 1 in the middle of the first row indicates that 1 walk of length one exists from vertex A to vertex B. The 
zero in the top right indicates that no walk of length one exists to get you from A to C. 

For advanced readers: using matrix multiplication, the powers of the adjacency matrix give the numbers of 
walks of longer lengths. Thus, squaring the matrix gives the number of walks of length two, cubing gives 
length three walks, etc. Does McGrath’s last statement now make sense?) 

To convince the skeptical reader that I am truly using random strings of numbers and not carefully selecting 
interesting sequences, for my next example I will use the numbers of letters in the first five words of this 
sentence. 

2, 8,3,9, 6, ••• 

It comes up! It is, among other things, “The spiral of Champemowne read by the South-Southwest ray.” I have 
never before heard of this. It is also part of the decimal expansion of Soldner’s constant, and part of the decimal 
expansion of 
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6 + 2VT0 

3 


Let’s try again. Let’s take the counting numbers and reverse their order, in pairs. 

2,1 4, 3,6, 5, ••• 

Well, OEIS comes up with our pattern, but scrolling down, I find something interesting. 

A096779: Smallest number not occurring earlier having no common digits with n in its decimal representation. 

Let’s see what this means. The first entry (n = 1) of our sequence is 2. 2 is the smallest number which has no 
1 ’s as digits. The second entry (n = 2) is 1. One is the smallest number with no 2’s as digits. 

Question: Will this sequence continue to swap natural numbers in the pairwise fashion that we started with? 
Answer: No! 

Exercise: Find the first 15 entries in this sequence. 

Challenge exercise: This sequence is finite! It has a last entry. Why? And what is this last entry? See OEIS 
for answers. 


Using Sequences to Solve Problems 

As we have seen, integer sequences can help us solve many kinds of problems. Let’s see if we can use 
the OEIS to solve some more. 

Problem. How many ways are there to seat 10 people in a circle (say, at a table?) 

Circular arrangements are trickier than linear ones. Let’s take the advice of George Polya. In his 
famous book How To Solve It , Polya suggests the strategy of solving a simpler, related problem first. 

So let’s start with one, two, or three people. One person can be seated in one way. Two people can also 
be seated in one way - and this differs from lining people up. With a linear arrangement, AB is different from 
BA. But arranged around a round table, AB is indistinguishable from BA. Two arrangements around a table 
can be considered different if, for any one person in one arrangement, the people to her right and left are 
different than in the other arrangement. 

What about three people? Now there are two arrangements. One is ABC, or the equivalent BCA or 
CAB, and the other is ACB or its equivalents CBA or BAC. 

Perhaps you have spotted a pattern. If we move any person from the end of the list to the beginning, it is 
an equivalent circular arrangement. So, for four people, 

ABCD =BCDA = CDAB = DABC 

But if we start with AC we get a different arrangement: 

ACBD = CBDA = BDAC =DACB 
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We could also start with AD. What if we start with B? We already have BCDA and BDAC. What 
about BA? 


BACD =ACDB =CDBA =DBAC 

We also need to count arrangements starting with ADC, ADB, and BCA. 

WE find that there are 6 possible arrangements. (Convincing yourself of this may take a little work!) 

So our sequence of answers for 1,2,3, or 4 people is 

1 , 1 , 2 , 6 ,- 

Entering this into the OEIS gives us sequence A000142 - the factorial numbers. Circular arrangements 
represent one of many descriptions. Note that the number of circular arrangements of n objects is (n — 1)! 

Now the answer to our problem is available without our needing to perform any calculations. The 
number of arrangements of ten people at a round table is 362,880. 

While we are here, let’s note that 


1 , 1 , 2 , 6 ,- 

is also the beginning of an entirely different sequence. A003418 lists the least common multiple of the set of 
numbers (1, 2, 3, ••• ,n}. In other words, for n — 4 our sequence entry is 6. 6 is the least common multiple of 1, 
2, 3, 4. 

Exercise. Find the first 15 terms of A003418. 

But these four numbers 1, 1, 2, 6 are also part of another sequence, the so-called primorial numbers. 
The nth primordial number is the product of all primes less than or equal to n. For this to make sense, we need 
to include 1 as a prime - it isn’t ordinarily considered to be prime. 

1,1,2,6,30,30,210,- 

Recall that prime numbers are numbers greater than or equal to 2 which have only one and themselves 
as divisors. A composite number has factors other than itself and 1. 

The primes (A000040) are 

2,3,5, 7,11,13,17,19,- 

So for n — 8, the primordial number would be the product of all the primes less than 8, or 


2 ■ 3 ■ 5 ■ 7 = 210 


Exercise: Find the first 15 terms of A003418. 

I have saved the best for last. 1, 1, 2, 6 is also part of sequence A001349 - the number of connected 
graphs with n nodes. We are back to graph theory! 

A connected graph is a graph in which it is possible to move from any vertex to any other vertex along 
existing edges. Imagine the map of the United States with each state represented by a point, or node. Place an 
edge connecting two states if those states share any border - that is, if they are contiguous with each other. This 
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graph will not be a connected graph. Hawaii, for instance, will be an isolated vertex - a vertex with no edges 
joining it to any other vertex. It will not be possible to move from any other state to Hawaii along existing 
edges. 

Let’s draw all the connected graphs with three nodes. 



There are two of them. 

Exercise: Draw all connected graphs with four nodes. 
Answer: 



Exercise: Draw all connected graphs with five nodes. 

Even More Fun 

Here are some challenges. Hello, Bill Mullan! Try to figure them out before looking them up. 
A029579 “An obvious mixture of two sequences” 

1,1, 2, 3, 3, 5,4, 7,5, ••• 
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A002808 


4, 6, 8,9,10,12,14,15,- 


A123865 


0,15,80,255,- 


Conclusion 


We have seen how exploring the Online Encyclopedia of Integer Sequences can be a lot of fun, and also 
how it can lead us to many mathematical explorations. Seeing connections between things, solving problems, 
looking at familiar things in new ways - all of these help to develop our mathematical intuition. Intuition is the 
mental faculty of seeming to understand things or know answers or predict outcomes without apparently having 
all the needed facts. What’s really going on is that your brain is using all kinds of knowledge and experience 
that you have begun to take for granted, so you aren’t entirely aware you are using it. 

Just starting with random sequences, we have explored number theory and graph theory. We’ve drawn 
pictures and made charts. We have proved theorems and discovered some mathematicians and the work they 
do. I have included a glossary of mathematical terms defined in this article at the end. 

I hope you will continue this exploration. There is a fun Wikipedia game: give every player the same 
starting point, like an article on South American beetles, and see how many clicks it takes them to find a 
Wikipedia article on some completely different target subject, like the novel Frankenstein. I would like to 
suggest an OEIS version of the game. Start with any random string of integers - if it isn’t in the OEIS you can 
pick a second one - and by clicking on linked sequences, try to eventually reach the prime numbers, the 
Fibonacci numbers, or some other sequence. 

I can’t resist adding one more. I just typed in 

1, 2, 5,9,17 

and I learn that this is the number of partitions of n if there are two kinds of 1 and two kinds of two! Now I will 
grab a blue marker and a red marker... 

Glossary 

Adjacency Matrix 

Closed walk 

Connected graph 

Edge 

Ellipsis 

Factorial 

Fibonacci number 
Graph 

Harmonic sequence 
Isolated vertex 
Node 

Padovan number 
Pancake Sequence 
Partition 
Prime number 
Primorial Number 
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Recursive 
Triangular number 
Walk 


More Bonus Questions 

1. Draw a digraph based on this adjacency matrix. 

-oioi- 
0 0 10 
0 10 1 
-1 0 0 0 - 

Now find the sequence of closed walks starting at A of length 1, 2, 3, 4, 5, 6, etc. 
(There will be many zeros in this sequence.) Is it in the OEIS? 


2. Can every positive integer be expressed as the sum of one or more Fibonacci numbers? 


3. Suppose we define an odd partition as a partition including only odd integers. Find the first ten terms of 
the sequence of odd partitions of 1,2,3,.. .,10. Is this in the OEIS? What else is it equal to? 

4. How many squares do you see? 


This is called a 3 x 3 chessboard. Find the number of squares in an n x n chessboard for n = 
1, 2, 3,4, 5. Check your sequence in the OEIS. 


5. Starting in the bottom left corner of a 3 x 3 chessboard and making only three kinds of moves - right 
one space, or diagonally up and over one space, or diagonally down and over - count how many ways 
there are to reach the bottom right comer. Do the same for n x n chessboards for n = 1,2, 3,4, 5. This 
is called the Motzkin sequence 
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Allowable Moves For The Motzkin Sequence. 


» / ^ 


6. Read my other article and explore arithmetic derivatives in the OEIS. 


References. 


N. J. A. Sloane, editor, The On-Line Encyclopedia of Integer Sequences , published electronically at 
https://oeis.org . 
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Zeros and Multiplicity 

Jeff VanHamlin 


The path of the curve for the function /(x) about its zeros depends on the multiplicity of each zero. 

Theorem: If /(x) = (x — c) n ■ g(x) where g(x) is a polynomial of degree n > 1 and g(c) =£ 0. 

Then x = c is a zero of multiplicity n and the following holds: 

I. If n = 1, the curve /(x) crosses the x-axis at x = c in a non-horizontal linear path. 

II. If n = 2, the curve /(x) has a local minimum or maximum on the x-axis at x = c. 

III. If n = 3. the curve /(x) has a saddle point on the x-axis at x = c. 

In general; for n > 1 and even, the function has a local minimum or maximum at x = c and for n > 1 and odd, the 
function has a saddle point at x = c. 

Proof: 

Let /(x) = (x — c) n ■ g (x), where g(x) is any polynomial where g(c) ^ 0 and n e M. Therefore /(x) is a polynomial 
which has a zero of x = c with multiplicity n. 

/'(x) = n(x — c) n_1 ■ g{x) + (x — c) n ■ g'(x) 

= (x — c) n-1 [n ■ g(x) + (x — c) ■ $'(x)] 

I. If n = 1, then 

/'(x) = (x - c) 1-1 [l ■ g (x) + (x - c) ■ ,g'(x)] 

= #(x) + (x - c) ■ £'(x) 

Therefore at x = c 


/'(c) = g(c) = non — zero constant 

This means around the zero, the slope is constant, and therefore linear. Moreover, the slope of the linear path around the 
zero is g(c), since the tangent line to the curve at x = c is y = g(c)(x — c). 

II. If n = 2, then 

/'(x) = (x - c) 2-1 [2 ■ g{x~) + (x - c) ■ g' (x)] 

= (x - c)[2 ■ flf(x) + (x - c) ■ flf'(x)] 

Therefore at x = c 


/'(c) = 0 
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This means at x = c the, the slope of the curve is 0. Giving a tangent line of y = 0 at x = c. 

By taking the second derivative, we can check the concavity of the curve. 

/"(*) = 1' [200*0 + (x- c)g'{x )] + (x - c)[2g'ix) + g'(x) + (x - c)g"(x)] 

At x = c, 


/"(c) = 2g(c) = non — zero constant 

Therefore, in the neighborhood around the x = c, /(x) is either concave up or concave down depending on whether g(c) 
is positive or negative. Therefore, the function /(x) has a local minimum or local maximum at x = c. 

111. If n = 3. then 

fix) = (x - c) 3-1 [3 ■ g{x) + (x - c) ■ g' (x) ] 

= (x - c) 2 [3 ■ g{x) + (x - c) ■ ^'(x)] 


Therefore at x = c 


/'(c) = 0 


This means at x = c the, the slope of the curve is 0. Giving a tangent line of y = 0 at x = c. 

By taking the second derivative, we can check concavity of the curve. 

f(x) = 2(x - c) ■ [3 g{x) + (x - c)g'{x )] + (x - c) 2 [3^'(x) + g'{x) + (x - c)g\x )] 


At x = c, 


/"(c) = 0 


To verify that x = c is a saddle point and not a local minimum, it must be verified that fix) has the same sign before and 
after x = c. 

Since 


/'(*) = (x - c) 2 [3 ■ g (x) + (x - c) ■ g'(x)] 

= (x — c) 2 [/i(x)] 

Then x = c is a local minimum of fix), verifying that the function is monotonic and therefore increasing or decreasing 
in the neighborhood of x = c. 

Therefore x = c is a saddle point of fix). 

Next, we will prove the factors of even multiplicity are local minimums or maximums at the zero and factors of odd 
multiplicity greater than 1 have a saddle point at the zero. 
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An induction proof will be used. We will first prove the even multiplicities and then the odd. 

In general 

/(*) = (x-cf- g(x) 

fix ) = n(x — c) n_1 ■ g(x) + (x — c) n ■ g'(x) 

= (x — c) n_1 [n ■ ,g(x) + (x — c),g'(x)] 

/"(x) = n(n — l)(x — c) n ~ 2 ■ g(x) + n(x — c) n ~ 1 g'(x ) + n(x — c) n_1 ■ g'(x) + (x — c) n ■ ,g"(x) 

= (x — c) n_2 [n(n — 1) ■ g(x) + 2n(x — c) ■ g'{x ) + (x — c) 2 ■ ,g"(x)] 

What now needs to be proved is the shape of the curve around the zero for even and odd multiplicities for integer values 
greater than n > 1. 

Let n be an even number greater than 1. 

For n = 2, it was verified that the curve has a local minimum or maximum at x = c. 

As for the induction proof we will assume the function 

y = (x-cf- g(x) 

has a local minimum or maximum at x = c for any even interger n 
Therefore, we must show 

fix) = ix - c) n+2 ■ g(x) 

has a local minimum or maximum at x = c, since (x — c) n+2 is the next zero of even multiplicity. 

The function has a zero at x = c, since 

/(c) = 0 

The function has a horizontal tangent line, because 

fix) = (n + 2)(x — c) n+1 ■ g{x) + (x — c) n+2 ■ g'{x) 

= (x — c) n+1 [(n + 2) ■ g{x) + (x — c) ■ g'(x)] 

And at x = c 

/'(c) = 0 

Finally 

/"(x) = (n + 2)(n + l)(x — c) n ■ g(x) + (n + 2)(x — c) n+1 ■ g'(x) + (n + 2)(x — c) n+1 ■ g'(x) + (x — c) n+2 
■£"(*) 

= (x — c) n [(n + 2 )(n + 1) ■ g(x) + 2(x + 2)(x — c) ■ ^'(x) + (x — c) 2 ■ ^"(x)] 
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At x = c, 


f"{c ) = 0 

Notice that the second derivative has the form 

/"(x) = (x — c) n ■ \h(x )] 

Therefore, from the assumption, the factor x — c has even multiplicity. So /"(x) has a local minimum or maximum at 
x = c and thus is concave up or concave down at x = c, so the first derivative changes sign before and after the zero 
x = c. Therefore /(x) has a local minimum or local maximum at x = c. 

Now to prove the odd multiplicity for n greater than 1. 

Let n be an odd number greater than 1. 

It has been shown that when n = 3, the function has a saddle point at x = c. 

Assume 

y = (x-c) n -^(x) 

To complete the induction proof, it must be shown that the function 

/(x) = (x - c) n+2 ■ g(x) 

has a saddle point at x = c, since (x — c) n+2 is the next zero of odd multiplicity. 

The function has a zero at x = c, since 

/(c) = 0 

The function has a horizontal tangent line, because 

fix) = (n + 2)(x — c) n+1 ■ g(x) + (x — c) n+2 ■ g'(x ) 

= (x — c) n+1 [(n + 2) ■ g(x ) + (x — c) ■ g' (pc)] 

And at x = c 

/'(c) = 0 

Finally 

/"(x) = (n + 2)(n + l)(x — c) n ■ g(x) + (n + 2)(x — c) n+1 ■ g'(x) + (n + 2)(x — c) n+1 ■ g'(x) + (x — c) n+2 
■£"(*) 

= (x — c) n [(n + 2)(n + 1) ■ g{x) + 2(x + 2)(x — c) ■ g'(x) + (x — c) 2 ■ g"(x)] 
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The second derivative has the form 


f"{x) = (x — c) n ■ [/i(x)] 

Since n is an odd number, /"(x) has a saddle point at x = c. Therefore the concavity of f"(x) changes at the zero x = c, 
so /'(x) has the same sign before and after x = c. So the function /(x) is monotonic around the zero x = c, and therefore 
a saddle point. 


25 


Volume 2, Number 1, Continuum 



Finding The Tangent Line to a Rational Function, using Division 

Nick Goins 


In this article, I will present a method for finding a tangent line to the graph of a rational function, using 
polynomial division. As we will see, the method will apply only to a certain sub-collection of rational functions 
and it will require that the point of tangency be on the y-axis. We will then discuss conditions under which we 
can generalize the method. That is, we will consider points of tangency which are not on the y-axis, and we will 
see when we are able to use a horizontal shift so that the corresponding point is on the y-axis. To motivate the 
method, we will begin with an example. 

x^ -hx^ —x+4 

Example 1: Consider the rational function f(x ) = ——q-j—. Using polynomial division to write /(x) as a 
proper rational function, we get 

—2x + 3 

/(x) = x + 1 + 

x z + 1 


Now find the tangent line to the graph of /(x) at x = 0. The point of tangency is (0,/(0)) = (0,4) and the 
derivative of /(x) is 


/'(*) 


(3x 2 + 2x — l)(x 2 + 1) — (x 3 + x 2 — x + 4)2x 
(x 2 + l) 2 


so that the slope of the tangent line is m — /'(0) 


= — 1. The tangent line is 


y — y l —m{x — x 1 ) => y — 4 = — l(x — 0) => y = — x + 4 


Notice that if we add the quotient and remainder found from using polynomial division, we get 


(x -|- 1) + (—2x T 3) — —x + 4 

□ 

That is, the tangent line is the same as the sum of the quotient and the remainder. Is this a very unique case or is 
there an underlying reason for this to happen more generally? 

Note: Before answering the question in general, let’s consider the following function similar to the above 
example, 


h(x) — mx + b + 


cx + d 

ax 2 + (3x + y 


We want to see if this function has the property that the tangent line to the graph of h(x) at x = 0 is the sum of 
mx + b and cx + d. We also want to see if there are any requirements on the coefficients for this to happen. 
The point of tangency for h{x) at x = 0 is 

(o,h(0)) = (o,b + d -) 
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The derivative of h(x) is 


h' (x) — m + 


c(ax 2 + fix + y) — (cx + d)(2ax + /?) 
(ax 2 + fix + y) 2 


and the slope of the tangent line is 


h'( 0) — m + 


cy — dfi 
Y 2 


The equation of the tangent line is 
y — y 1 = m(x — xfi) 



+ 


cy — dfi 
Y 2 



which reduces to 

/cy — dfi\ d 

y — mx + b + --— x + — 

\ y 2 ) y 


The first two terms are the quotient and now we will set the last two terms equal to the remainder to find 
conditions on the coefficients. That is, 

/cy — dfi\ d 

y ——J x + — — cx + d => y — 1, fi — 0 

That is, the result holds if the denominator has the form ax 2 + l.The following theorem provides the general 
answer to the question posed above. 


Theorem 1: For a rational function f(x) — the tangent line to f(x) at x = 0 will be identical to the sum of 
the quotient and remainder if and only if f(x) can be written in the following form 


r(x) 

m = mi x + b 1+ — 

where the remainder, r(x), is linear and d(x) is a polynomial of degree at least two, and has the form d (x) = 
a k x k + a^jX^ -1 + —I- a 2 x 2 + 1. 


Proof: The point of tangency is 


( 0 ,/( 0 )) 



r( 0) \ 

d(0)J 


To find the slope of the tangent line, we need to differentiate f(x ) 


/'(x) — m + 


r'(x)d(x) — r(x)d'(x) 
(d(x)) 2 


so that the slope of the tangent line is 
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m — /'(O) — m + 


r'( 0 )d( 0 ) — r( 0 )d'( 0 ) 
(d(Oj^ 


Then, the equation of the tangent line is 


(u , r (°)\ / , r'( 0 )d( 0 ) -r( 0 )d'( 0 )\ 

y~(b + -77777 ) = ( m +- 7377 ^-) (^ — 0 ) 


d( 0 ) 


(d( 0)) 2 


r( 0 ) /r'( 0 )d( 0 ) — r( 0 )d'( 0 )\ 

y — b + — + mi + - 7 ——: - x 

d ( o ) I ( d (°)) 2 / 


/r'( 0 )d( 0 ) — r( 0 )d'( 0 )\ r( 0 ) 

y - mi + b + - „ -I x + 


(rf(0)) 2 


d( 0 ) 


We want to find conditions on r(x) and d(x) so that 


r'(0)d(0)-r(0)d^(0)'' 


x + ^- 7 - is equal to the remainder. 
d( 0) M 


( d ( 0)) 2 

Since the function r(x) is linear, we can write r(x) = m 2 x + h 2 - For this to happen we need to solve the 
following system of equations 


(r'( 0 )d( 0 ) — r( 0 )d'( 0 ) 

Woyy 

r(0) h 

d( 0 ) = bz 


= m 2 


From the definition of r(x), we get r(0) = b 2 , so that the second equation above gives d(0) = 1. Substituting 
this into the first equation in the system gives 


r'( 0 )d( 0 ) — r( 0 )d'( 0 ) 

(dW 2 


= m 2 


m 2 - b 2 d'( 0 ) = m 2 


This last equation then says that d'(0) = 0. Knowing that d(x) is a polynomial with d(0) = 1, we know that 
the constant term must be 1. Knowing that d'(0) = 0, we know that the coefficient of the first degree term in 
d(x) must be 0. That is, the form of d(x) must be a k x k + a /c _ 1 x fe_1 + —h a 2 x 2 + 1. 

□ 

2x—l 

Example 2: The tangent line to /(x) = 3x + 5 + 3 4 2+1 , at x = 0 is y = 5x + 4. 

□ 


Corollary: The following is a direct result of the theorem. 


/(x) = mjX + + 


m 2 x + b 2 

a k x k + a k _ 1 x k ~ 1 + —h a 2 x 2 + 1 


/'( 0 ) — m 1 + m 2 
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+1 

Example 4: Show that x = 0 is a critical point of /(x) = — 7 -^, using only algebra. 


Solution: So that the function has the form in the corollary, we can factor out the 2 from the denominator, 


f(x) = 


X 3 + 1 1 / X 3 + 1 


* 2+2 2 4x’+i, 


X^+l 1 1 

Then, consider the function g (x) = T -, so that fix) = -g(x) and therefore fix) = -gfx). Using 

—x 2 +1 2 2 ^ 

2 

polynomial division on $ (x) gives 

— 2 x + 1 
gix) = 2x + - 

2* 2 + l 

From the corollary, we get $'(0) = 2 — 2 = 0 and therefore, /'(0) =--0 = 0. 

A Slight Generalization of the Above Technique 


□ 


Consider the function fix) = (x — 3) 2 . The point of tangency for the tangent line at x = 2 is (2,/(2)) = (2,1) 
and the slope is m — /'(2) = 2(2 — 3) = —2. Thus, the tangent line is y — 1 = —2(x — 2), which is yfx) = 
—2x + 5. Now, consider shifting the graph of fix) so that the point of tangency moves to the y-axis. To 
accomplish this, the point (2,1) shifts two units to the left to the point (0,1). That is, we consider the function 
gix) — fix + 2). Now find the tangent line to g (x) at x = 0. The function expression for g (x) is g (x) = 

(x — l ) 2 and the point of tangency is ( 0 ,,g( 0 )) = ( 0 , 1 ) and the slope of the tangent line is m — g'i 0 ) = 

2(0 — 1) = —2 so that the tangent line is y — 1 = —2(x — 0), giving y 2 (x) = —2x + 1. Finally, consider 
shifting the tangent line y 2 (x) two units to the right, 

y 2 (x — 2) = —2(x — 2) + 1 = —2x + 5 

The idea in the above discussion will be stated and as a theorem and proved next. We will then return to the 
idea of finding tangent lines to rational functions, in particular at locations off of the y-axis. 

Theorem 2 (Horizontal Shift of Tangent Fines): If gix) — fix + c) and yi(x) is the tangent line to g (x) at 
x = 0 then y 2 (x) = y 1 (x — c) is the tangent line to fix) at x = c. 

Proof: Suppose yi(x) is the tangent line to g (x) at x = 0. Then, 

Vi = mfx-xf + b t => y t = g'i0)ix-0) + giO) 

Since gix) — fix + c), then ,g(0) = /(c) and g\x) = /'(x + c), so that $'(0) = /'(0 -I- c) + /'(c), 

=* yi= f'ic)x + fic) 
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Then, y 2 (x) = y x (x — c) = /'(c)(x — c) + /(c), which is equivalent to 


y 2 -/(c) =/'(c)(x-c) 


This latter equation is the tangent line to /(x) at x = c. 

2^3 -j-x— 1 

Example 5: Find the tangent line to /(x) = ^ 2 ) z +i at X = w ' l h° ul computing a derivative. 


□ 


Solution: Since the point of tangency is not on the y-axis, we will shift the graph horizontally to the left so that 
the point of tangency will be on the y-axis. We will find the tangent line for the shifted function, then perform 
the opposite shift to the tangent line to obtain the tangent line to the original graph. That is, define g (x) = 

/(x + 2). Then, the expression for g (x) is 


g(x) = f(x + 2 ) = 


2 (x + 2) 3 + (x + 2) - 1 
((x + 2) - 2) 2 + 1 


2x 3 + 12x 2 + 25x + 17 
x 2 + 1 


Using long division to write the expression as a proper rational function gives, 

23x + 5 

ff(x) = 2x + 12+ , , i 


Then, the tangent line to g (x) at x = 0 is 

y 2 (x) = (2x + 12) + (23x + 5) = 25x + 17 

To obtain the tangent line to the original function /(x), at x = 2, we will shift the tangent line y 2 two units to 
the right. That is, 

y x (x) = y 2 (x — 2) = 25(x — 2) + 17 = 25x — 33 

□ 


As the previous example illustrates, we can apply the method of finding a tangent line to a rational function 


using the results of division, if the rational function has the form /(x) = 


p(x) 


(x-c) 2 +1 


, where the degree of p (x) is 


3. The most general case is the following 


fix) = 


Pjx) 
d(x — c) 


where, the degree of p(x) is one more than the degree of d(x) and d(x) has the form a k x k + a k - 1 x k 1 + —h 
a 2 x 2 + 1. 


In the following theorem, note that the first use of x = h refers to a specific x-value whereas the second use 
refers to a vertical line. 


Theorem 3: For a rational function fix) 


P (*) 
d(x)’ 


such that deg(p(x)) = 3 and d(x) has no real roots, we can 
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find the tangent line to /(x) at x = h, where x = h is the axis of symmetry of d (x), using only algebra. 
Proof: Writing d(x) in standard form, and using the fact that it has no real roots, we can write the following 


f(x) = 


Example 6: For the function, /(x) 


p(x) p(x) 1 

d(x) a(x — h) 2 + k k 


p(x) 


fU-h ) 2 + i 


□ 


x 2 +4x+5 


we can write 


/(x) = 


x 3 + 1 


x 3 + 1 


x 2 + 4x + 5 (x + 2) 2 + 1 


so that the tangent line can be found at x = —2, by first doing a horizontal shift. Similarly, consider the rational 


function p(x) = 


x 3 +l 

X 2 +4x+7 


, where we can write 


9 O) = 


x 3 + 1 


x 3 + 1 


X 3 + 1 


x 2 + 4x + 7 (x + 2) 2 + 3 3\^1 (x + 2)2 + 1 ^ 


For this function we can also find the tangent line at x = — 2. 


□ 


If we have a tangent line to a function g (x) at x = a and we know that g (x) and /(x) are related by g (x) = 
c/(x) then how do we obtain the tangent line to /(x)? The following theorem will state and prove the fact that 
if we stretch or compress a function, then we correspondingly stretch, or compress, the tangent lines by the 
same factor. 

Theorem 4: If p(x) = c/(x) and if y t is the tangent line to /(x) at x = a, then cy t is the tangent line to g (x) at 
x = a. 

Proof: The tangent line to /(a) at x = a, is y t — /'(a)(x — a) + /(a). The point of tangency on the graph of 
p(x) is 

( a,g(a )) and the slope is m = g\a), but with the relationship g (x) = c/(x), we can write this information as 
(a, c/(a)) and m — c/'(a), respectively. Thus, the tangent line to p(x) at x = a is 

y — cf(ja) = cf'{a){x —a) => y = c/'(a)(x — a) + c/(a) => y 

= c(/'(a)(x — a) + /(a)) 

□ 


The following example will illustrate Theorem 3 and Theorem 4. 


Example 7: Fet /(x) 
of the denominator. 


^3 _ i _^2 

—--. Find the tangent line to /(x) at x = h, where x = h is the axis of symmetry 

x +6x +13 
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Solution: Completing the square on the denominator gives the following for fix) 


X 3 + X 2 + X + 1 

= (x + 3 ) 2 + 4 

From the previous theorem, we see that we can find the tangent line to this function at x = —3, by first shifting 
the graph three units to the right. That is, we will define gix) = f(x — 3) and find the tangent line to g (x) at 
x = 0 . 


g O) = fix- 3) 


(x — 3 ) 3 + (x — 3 ) 2 + (x — 3) + 1 
((x - 3) + 3 ) 2 + 4 


x 3 — 8 x 2 + 22 x — 20 
x 2 + 4 

Factoring out the 4 from the denominator puts the function expression in the form we need, 


x 


3 - 8 x 2 + 22 x - 20 


L x 2 + 1 


Define h(x) — 


x*-8x A +22X-20 


-x z + l 


y 

, so that gix) — - h (x). Then, using polynomial division on h(x) gives 


h(x) 


= 4x — 32 + 


18x + 12 

l x2 + 1 


Thus, the tangent line to h(x) at x = 0 is y = (4x — 32) + (18x + 12) = 22x — 20. By a previous theorem, 

y 

the tangent line to g(x) at x = 0 is y = - ( 22 x — 20 ) and finally, by shifting three units to the left, the tangent 

4 

1 1 11 ?^ 
line to f(gc) at x = 3 is y — - (22(x + 3) — 20) = - (22x + 66 — 20) = —x + — 

□ 

In all of the preceding discussion, we focused on rational functions and certain values of the first derivative, the 
following theorem provides a basic result regarding a class of rational functions and a value of the second 
derivative. 


Theorem 5: For a rational function of the form, 


f(x) — mjX + b 1 + 


m 2 x + b 2 

a k x k + a k _ 1 x k ~ 1 + —I- a 3 x 3 + 1 


x = 0 will be a zero of f", but is not an inflection point. 
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Proof: To simplify the notation, we will write f(x ) = m^x + b 1 + where r(x) and d(x) are the functions 
stated above. The first derivative of /(x) is 


/'(x) = 7 % + 


r'd — rd' 


d 2 


Since r(x) is a linear function, we know that its derivative is m 2 . Substituting this is and taking the second 
derivative of /(x) gives 


/'(x) = 7 % + 


m 2 d — rd' 
d 2 


/"(*) = 


( m 2 d' — m 2 d' — rd")d 2 — ( m 2 d — rd') 2 dd' 
d 4 


Then, the second derivative evaluated at x = 0 is 


/"( 0 ) = 


(m 2 d'( 0 ) — m 2 d'(0) — r( 0 )d"( 0 ))d 2 ( 0 ) — (m 2 d( 0 ) — r( 0 )d'( 0 )) 2 d( 0 )d'( 0 ) 

d^O) 


We know the function values, r(0) = b 2 , d(0) = 1, d'(0) = 0 and d"(0) = 0, so that the above reduces to 

-b 2 d"( 0) 

f "( 0 ) = —=;—— = 0 
7 V 7 d 4 ( 0 ) 

This shows that x = 0 is a potential inflection point, but it will depend on the sign of /"(x) on either side of 
x = 0. Since the numerator is a polynomial, we can determine its degree. Also, since the denominator will be 
strictly positive, the sign is completely determined by the numerator. Consider the following function 

y = — rd"d 2 — 2dd'{m 2 d — rd') 


Let deg(d(x)) denote the degree of the function d(x). Since r(x) is linear, deg(r(x)) = 1. Then, the degree of 
the first term is 

(deg(d(x)) — 2 ) ■ 2 deg(d(x)) 

which reduces to 

>2 


2(deg (d(x))) — 4deg(d(x)) 


That is, the first term has even degree. The degree of the second term is 


deg(d(x)) ■ (deg(d(x)) - 1 ) ■ deg(d(x)) 


which reduces to 

(deg(d(x ))) 3 - (deg(d(x ))) 2 

which is also even. Thus, the numerator is an even degree polynomial and so does not change sign on either side 
of its zero. 
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Composition of piecewise functions and their domain 

Jeff VanHamlin 


I am going to present a method for determining the function expression for the composition of piecewise functions. To do 
this, lets first look at the case of evaluating a composition of piecewise functions at a value. Consider the piecewise 
functions 


fix) = { 


4x — 3, 
— 2 x + l, 


x < —1 
x > — 1 


and 


r — x — 7, x < 3 
~l 5x + 2, x > 3 

When evaluating a piecewise function at an input, the input restrictions determine which of the functions we are to use in 
the piecewise expression. If we want to evaluate the composition (/ ° g)( 4), we must first evaluate g{ 4) and then 
evaluate the function / at the output value of g (4). To do this we first determine which set or restriction the element 4 
belongs to in the g function. Since 4 satisfies the inequality x > 3, it belongs to that restriction set. Therefore, we use the 
corresponding function rule 5x + 2, giving 5(4) + 2 = 22. Now we must evaluate /(22). To do this we determine which 
restriction set that 22 belongs to in the / function. Since 22 satisfies the inequality x > —1, we will use the corresponding 
function rule —2x + 1. This will give us a final answer of —2(22) + 1 = —43. 

This can be condensed into the calculation below 


if ° gX 4) = f{g{ 4)) = /(5(4) + 2) = fill) = -2(22) + 1 = -43 

This gives us some insight into building a function rule for the composition. The composition depends largely on the 
restrictions. To evaluate an input, x, for the composition we first look at the restrictions on the g function. We will use the 
corresponding function rule depending on what restriction set the value of x satisfies. To determine what function rule of 
/ is used after the evaluation of g, we look at the set the output value g{x) satisfies. 

The above calculation of /(^(4)) used the function rule was —2(5x + 2) + 1 evaluated at x = 4. We used the 
composition of the g(x ) = 5x + 1 with /(x) = —2x + 1 because x = 4 is in the restriction set of x > 3 and $(4) = 22 
is in the restriction set of x > — 1. So we must look at the intersection of the sets. 


Function expression for the composition of piecewise functions 


Using the same functions, we determine the composition expression of the piecewise functions. Given 


fix) = { 


4x — 3, 
— 2 x + 1 , 


x < — 1 
x > —1 


and 
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g O) = { 


—x — 7 , 
5x + 2, 


x < 3 
x > 3 


To determine the composition, we will start with the restriction sets of g(x). 

Starting with the top restriction set of the g function. If x < 3, there are two possible restriction outcomes of the 
function rule g(x) — — x — 7. Either the outputs will be g(x) < — 1 or g(x) > — 1. This means we must solve 
these inequalities for both of piecewise expressions of g. 

Solving g(x) < — 1, gives — x — 7 < — 1 => x> —6. 

Since we are looking at the x-values that satisfy x < 3, we will use the expression —x — 7 in g. Also since 
g (x) < —1, we will use the expression 4x + 3 in /. Therefore, we will compose these two expressions at the 
intersection of the sets x < 3 and x > — 6. Resulting in the composition 

4(—x — 7) + 3, —6 < x < 3 

Solving g(x ) > —1, gives — x — 7 > —1 => x < —6. 

Since we are looking at the x-values that satisfy x < 3, we will use the expression —x — 7 in g. Also since 
g(x) > —1, we will use the expression —2x + 1 in /. Therefore, we will compose these two expressions at the 
intersection of the sets x < 3 and x < — 6. Resulting in 

—2(—x — 7) + 1, x < —6 

Now we must repeat this process for the next restriction set of g. If x > 3, again there are two possible 
restriction outcomes for the expression g (x) = 5x + 2. Either the outputs will be g (x) < — 1 or g (x) > — 1. 
This means we must solve these inequalities for both of piecewise expressions of g. 

3 

Solving g(x) < —1, gives 5x + 2 < —1 => x < — 

Since we are looking at the x-values that satisfy x > 3, we will use the expression 5x + 2 in g. Also since 
g(x) < —1, we will use the function expression 4x + 3 in /. Therefore, we will compose these two expressions 

3 

at the intersection of the sets x > 3 and x < —. Notice that the intersection of these sets is the empty set. 
Therefore, the composed expression 4(5x + 2) + 3 will not be used. 

3 

Solving g(x) > —1, gives 5x + 2 > —1 => x > — -. 

Since we are looking at the x-values that satisfy x > 3, we will use the expression 5x + 2 in g. Also since 
g(x) > —1, we will use the expression —2x + 1 in /. Therefore, we will compose these two expressions at the 

3 

intersection of the sets x > 3 and x > — -. Resulting in 

—2(5x + 2) + 1, x > 3 
The final expression for the composition becomes 
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2(—x — 7) + 1, x < —6 

(/ ° ,g)(x) = | 4(—x — 7)+ 3, 6<x<3 

( —2(5x + 2) + l, x>3 


f 2x + 15, 
j-4x-25, 
l —lOx — 3 


x < —6 
6 < x < 3 
x > 3 


To generalize this technique. Given i,j, n,m e N, 0 < i < m, 0 < j < n and A L is the domain of ft and B L is 
the domain of gj . Given the functions 


and 


g(x) = 


7i. 

xeA ± 

h. 

xeA 2 

.fm, 

X£ A m 

r 9i, 

xeB 1 

92, 

xeB 2 

<9n, 

xeB n 


Then the composition of the functions is 


r fi°9i. X£ B 1 n g{x)eA 1 
fi ° 9 2 > X£ B 2 n g(x)eA 1 


(/ °gXx) = 1 


fi°9n, X£ B n n g(x)eA 1 
fz 0 9i, xeB 1 n g(x)eA 2 
f 2 °g 2 , xeB 2 ng(x)eA 2 

f 2 og n , xeB n n g(x)eA 2 


= {ft ° 9j 


xeBj n g(x)eAi 


fm°9 1 , xeB 1 n g(x)eA n 
fm ° 92’ X£ B 2 ng(x)eA n 


Vm ° 9n, X£ B m n g(x)eA n 


If xeBj n g(x)eA L is the empty set, exclude the composed function expression ° gj from the composition. 
This also means the amount of piecewise expressions in the composition will be at most the product of m and 
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Domain of a composition 


if 


and 


/(*) 


x — 2 

x + 3’ 
, 3x + 1, 


x < 0 
x > 0 


g(x) 


Vx + 5, x < 1 
x 2 — 4, x > 1 


The domain of a composition is defined by 

Domif o g) — {x e Dom(g) n ( Rng(g) e Dom{f )} 

First, we must find the domain of g. This can be found by determining the domain of the piecewise expressions 
on their corresponding restrictions. 

For the expression Vx + 5 we know that x + 5 > 0 on the restriction of x < 1. Therefore, the function rule 
Vx + 5 is used for the x-values at the intersection of x > — 5 and x < 1 which is the set [—5,1). Since the 
expression x 2 — 4 is defined for all x, we will use the expression x 2 — 4 for all values on the set x > 1 which is 
the set [1, oo). This means the domain of g is the set [—5,1) U [1, oo). Therefore Domig) — [—5, oo). 

The domain of / is found similarly. 

x —2 

For the expression —, we know that x + 3 =£ 0, so x =£ — 3 on the restriction x < 1. Since x = — 3 is an 

element of the set x < 1 it must be excluded from the domain of /. This is the set (— oo, —3) U (—3,1) The 
second expression of / is 3x + 1 and it is defined for all real numbers on the restriction x > 1. This is the set 
[1, oo). Therefore domif) — (—oo, — 3) U (—3,1) U [1, oo) = (— oo, —3) U (—3, oo). 

Now we must determine when gix) is an element of the domain of /. This is done by determining when the 
output of the g function is an element of the domain of the / function. 

Rngig) e Domif) 

When determining Rngig) e Domif) we investigate both g expressions Vx + 5 and x 2 — 4, we must look at 
the invalid input of —3 for the domain of /. We determine any value of x when evaluated g gives an output of 
—3 and exclude those values. 

Vx + 5 ^ —3 -> infinite solutions — (—00,00) 

The expression Vx + 5 is always non-negative, giving the solution of all real numbers. 
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x 2 — 4 =£ — 3 -> x 2 =£ 1 


X =£ ±1 = (— 00 , —1) U (—1,1) u (1, oo) 


When using the expression Vx + 5, we know it is restricted to the values x > 1, therefore we need only exclude 
x = 1 from the domain of the composition. 

Note, Dom(g) — [—5, oo) and Rng(g) e Dom(f ) when x e (—oo, 1 ) u (1, oo). The intersection of these two 
sets give the domain of the compostion. 

Therefore dom(/ ° g) — [—5,1) U (1, oo). 

To verity this is correct, we will find the function expression for this composition and verify the domain found 
is correct. 

Solve all g (x) expressions for the restrictions of / and determine their intersections with the restrictions of g. 

Vx + 5 <0 -> no solution 

'Jx+5—2 

This means the composed expression , — will not be used. 

r r Vx+5+3 

Vx + 5 >0 -» x + 5 > 0 x > — 5 

Now the intersection of x > — 5 and x<lis— 5 < x < 1. Giving the composed expression 3 Vx + 5 + 1 

on the restriction of — 5 < x < 1. 


x 2 — 4 < 0 — 2 < x < 2 

The intersection of —2 < x < 2 and x>lisl<x<2. Giving the composed expression 
= ^ 7 -^ on the restriction of 1 < x < 2 . 

O z -4)+3 X 2 ~l 

x 2 — 4 > 0 x < —2 or x >2 

The intersection of x < — 2 or x >2 and x > 1 is x > 2. Giving the composed expression 
3(x 2 — 4) + 1 = 3x 2 — 11 on the restriction of x > 2. 

Therefore 

{ Vx + 5, — 5 < x < 1 

X —^, 1 < x < 2 

x^ — 1 

3x 2 — 11, x > 2 

Notice the domain of the function can be calculated by determining the domain of each expression on their 
corresponding restrictions. 

Vx + 5 -> x + 5>0 -> x > — 5 
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This is only for the restriction of — 5 < x < 1. 

x 2 — 6 

—- x 2 — 1 =£ 0 -> x =£ +1 

x 2 - 1 

This is only for the restriction ofl < x < 2, sox ^ - lis the only value excluded. 

3x 2 - 11 


The domain of this expression is all real numbers. 
Therefore 


dom(f o g) = [—5,1) U (1, oo). 


Exercises: 


1. Given the piecewise functions 


and 


fW = { 

9(x) = { 


—5x + 1, 
x — 4, 

3x — 2, 
—2x + 3, 


x < —2 
x > — 2 

x < 1 
x > 1 


a. Determine the value of / (g (—3) ). 

b. Determine the value of / (g (4) ). 

c. Determine the function expression for the composition /(^(x)). 

d. Use the function expression found in c. to verify exercises a. and b. 


2. Determine the domain of the composition f(g(x)) given the piecewise functions 


(— 5x + 1. 

x < 0 

>< 

II 

<U 

* 

1 

x > 0 

( x — 5 


■'f 

1 

II 

2 

CD 

x < 2 

l x 2 + 7, 

x > 2 


3. Determine the function expression for the piecewise function in exercise 2. Use the function expression to verify 
the domain. 
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The Arithmetic Derivative 


Paul Bedard 
Introduction 

One of the best tricks I know for understanding a mathematical concept more fully is to apply the 
concept inappropriately. That is, to use a rule or an algorithm in a situation never intended for its use. Whoever 
first put chocolate on a pretzel understood this. Many of the great mathematical discoveries have come about 
through the creative misapplication of ideas. Indeed, without a great deal of flexibility in the matter of what we 
permit ourselves to think about, modem mathematics as we know it wouldn’t exist. (WE must not forget, 
however, that some nonsense is truly and simply nonsense. When playing in the sandbox, the refusal to cling to 
traditional architectural forms must be accompanied by a willing fist, ready to quickly obliterate grotesqueries.) 

Against this notion, Simon Stevin said of complex numbers in 1585, “There is enough legitimate matter, 
even infinitely much, to exercise oneself without occupying oneself and wasting time on uncertainties.” (Swetz, 

p. 138). But Leibniz, also discussing the same subject, said “One would never have believed that V1 + V—^ + 

V1 — V— 5 make and there is something hidden therein which is incomprehensible to me.” (Swetz, p. 138). 

Finding such hidden somethings in apparently nonsensical or meaningless applications (such as applying 
the square root concept to negative numbers) is a fmitful source of new mathematics. But it is also a way to get 
a better grasp of the old. 

In this essay I will show how the product rule for derivatives can be applied to integers, creating a 
sequence of integers referred to as “arithmetic derivatives.” If you have no familiarity with calculus, please 
read on nonetheless. Though the first part of the essay, in which I review the product rule as used in calculus, 
might not mean much, the second part can be understood readily by anyone who can multiply and add whole 
numbers. You might achieve an appreciation of a calculus technique without ever knowing what calculus is! 


The Leibniz Rule 


The Leibniz rule, often called the product rule, is a rule intended to facilitate the differentiation of 
products of differentiable functions. The rule, stated in Leibniz or differential notation, is 


d dv du 

— (uv) = u — + v — 
dx dx dx 


Here it and v are differentiable functions of x. The right side of the definition may be read: 

“The first times the derivative of the second, plus the second times the derivative of the first. ” 

(I can’t neglect to point out that the most common student error when taking derivatives of products, what I call 
the “first false product rule,” - the false idea that the derivative of a product is the product of the derivatives - 
was an error first made by Leibniz himself. However, as I tell my students often, making errors no matter how 
silly is not a problem. The problem is failing to recognize and correct them.) 
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Example 1. A calculus derivative with and without the Leibniz rule. 

Find the derivative of y — (2x + l)(3x — 1) 

We may do this in either of two ways. Multiplying the binomials before differentiating reduces the problem to 

d 

— (6x 2 + x — 1) 
dx 

Applying the power rule (and the sum rule) quickly yields 

d 

— (6x 2 + x — 1) = 12x + 1 
dx 

We don’t need the product rule at all. But it is instructive to see how it works here. 

(2x + l)(3x - 1) = (2x + 1) (3x - 1) + (3x -!)■$- (2x + 1) 

dx dx dx 

Now we can apply the power rule to find each of the derivatives and this leads to 

(2x + 1) ■ 3 + (3x — 1) ■ 2 = 6x + 3 + 6x — 2 — 12x + 1 

There is nothing that builds the confidence in a new rule as much as seeing it yield the same answer as a trusted 
old rule! 

The power rule, which we will see again in a new context later, applies to derivatives of power functions of x. 

The Power Rule for (Calculus) Derivatives 



- nx n 


i 


The Arithmetic Derivative 


“The arithmetic derivative n' was introduced, probably for the first time, by the Spanish mathematician 
Jose Mingot Shelly in June 1911 with "Una cuestion de la teorfa de los numeros", work presented at the "Tercer 
Congreso Nacional para el Progreso de las Ciencias, Granada.” 

(From The Online Encyclopedia of Integer Sequences, https://oeis.org/A003415). 

The calculus derivative of any constant function is identically equal to zero. This means that the 
derivative of a simple number, say 5, when thought of as a function /(x) = 5, is zero. The arithmetic derivative 
is not a calculus derivative. We are going to define a new function on the integers and call it an arithmetic 
derivative simply because it reproduces some properties of calculus derivatives. In fact, we are going to define 
this function in terms of one of these properties - the Leibniz rule. 

We want the arithmetic derivative of a positive integer to be another positive integer, and we want the 
Leibniz rule to be satisfied. 
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Definition. The arithmetic derivative of a positive integer N is a unique positive integer AT such that 

(1) If N is prime then N' — 1, 

(2) If N — a ■ b for any positive integers a and b then N’ — ab' + ba 1 
(N' may be read as “N prime”; prime notation here is borrowed from calculus.) 

There are some things we must note about this definition. First, it doesn’t directly construct arithmetic 
derivatives, except in the case of primes. In other words, it provides no formula for calculating an arithmetic 
derivative. Yet, it is a perfectly acceptable definition. We will examine why this is so. 

First, we should ask ourselves whether N', defined this way, will be unique. In other words, is this a 
function (just one output for each input) on the positive integers at all? If an integer can be shown to have two 
different arithmetic derivatives, both of which satisfy the definition, then we have a problem. And in fact, it is 
reasonable to be concerned. Notice we said “if N = a ■ b for any positive integers a and b”. Obviously, many 
integers can be factored into a product of two other integers in many ways. 

Example 2. Multiplicity of representations. 


120 = 1 ■ 120 = 2 ■ 60 = 3 ■ 40 = 4 ■ 30 = 5 ■ 24 = 6 ■ 20 = 8 ■ 15 = 10 ■ 12 


In order for our definition to yield a unique arithmetic derivative for 120, the values of the arithmetic 
derivatives of all of these factors must be such that no matter which factoring we choose, when we apply the 
Leibniz rule, we get the same answer. 

Let’s start by stating a theorem. 

Theorem 1. V — 0 

Proof. Let N be any positive integer. Then N = N ■ 1 and therefore N' = N ■ 1' + 1 • N'. 

Subtracting N' from both sides yields 0 — N ■ 1'. By the zero product property of multiplication (which says 
that, if a product equals zero, at least one of the factor must equal zero), since A A 0 then V — 0. 


When we find a general rule for calculating arithmetic derivatives of any positive integer, we will see 
that the arithmetic derivative of a number is the same no matter how we factor the number. 

Calculating Arithmetic Derivatives 

Let’s begin to create a table of arithmetic derivatives. It will be very incomplete at this point. 

Table 1 (a) 


N 

N' 


N 

N' 

1 

0 

9 


2 

1 

10 


3 

1 

11 

1 

4 


12 


5 

1 

13 

1 

6 


14 


7 

1 

15 


8 


16 
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We have filled in the arithmetic derivative of 1, from theorem 1, and the arithmetic derivatives of the primes 
listed, which are equal to 1 by definition. 

Can we find the arithmetic derivative of a composite number? If the composite number is a product of 
two primes, we surely can. Let’s find 6 '. 

Example 3. The arithmetic derivative of a product of two primes. 

6 ' = (2 ■ 3)' = 2 ■ 3' + 3 ■ 2' = 2 ■ 1 + 3 ■ 1 = 5 

Can we come up with any more general formula from this example? We can! 

Theorem 2. If N — p ■ q where p and q are primes, then N' — p + q. 

Proof. Let N — p ■ q where p and q are primes. 

Then N 1 = (p ■ q)' = p ■ q' + q • p' = p ■ 1 + q ■ 1 = p + q ■ 


Don’t worry - the proofs will get less trivial. 

From theorem 2, we can determine the arithmetic derivatives of 10, 14, and 15. 

Example 4. More arithmetic derivatives of products of two primes. 

10 ' = 2 + 5 = 7, 14' = 2 + 7 = 9, 15' = 5 + 3 = 8 
Let’s add these to our table. 

Table 1 (b) 


N 

JV' 


N 

N' 

1 

0 

9 


2 

1 

10 

7 

3 

1 

11 

1 

4 


12 


5 

1 

13 

1 

6 

5 

14 

9 

7 

1 

15 

8 

8 


16 



So far, we have a rather nifty idea - a function which assigns to composite numbers which can be factored as 
the product of two primes, the sum of those primes. But what about a number which is a product of two 
nondistinct primes - for instance, 9 which is the product of 3 and 3. The theorem should work for such 
numbers also. 

Example 5. The arithmetic derivative of a prime squared. 

9 ' = (3 ■ 3)' = 3 + 3 = 6 
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But what about number which are the product of more than two primes? Like 8 which is 2 3 (the product of 
three primes, all the same) and 12 which is 2 2 ■ 3? And what about 16 which is 2 4 ? 

The next theorem will help us with 8 and 16. We are slowly building to a general formula, never fear. 

Theorem 3. If N = p n where p, n are positive integers and p is prime, then N' — np n ~ 1 . 

Proof. Let N — p n where p, n are positive integers and p is prime. 

If n — 1, then the theorem holds since N' — p' — 1 by definition, and by the theorem 

(p 1 )' = 1 ■ p 1_1 = 1 ■ p° = 1 . 

We will proceed by induction. Suppose the theorem holds for n = k — 1. Then for 
N = p k ~ 1 ) N' — (k — 1 )p k ~ 2 . 

Suppose N — p k . Then N' — ( p k )' = (p ■ p^ -1 )' = p ■ (p fc_1 )' + (p^ -1 ) ■ p' 

Since (p^ -1 )' = (k — 1 )p k ~ 2 and p' = 1, we get 

N' = p • (k — 1 )p k ~ 2 + (p k ~ 1 ) ■ 1 = /cp^ -1 . ■ 

Did you notice anything special about theorem 3? It looks a lot like the power rule for calculus 
derivatives! Let’s take a moment to reflect on what has happened here. We started by defining a new kind of 
derivative, not based at all on calculus derivatives except that we insisted on the Leibniz rule. There was no 
particular reason to expect arithmetic derivatives to share anything else in common with calculus derivatives. 
But we now see that the power rule has come along for the ride! 

What does this tell us about the relationship between the power rule and the Leibniz rule? Already, our 
outside-the-cube thinking has had positive results. We are tricking ourselves into examining relationships 
within “real” mathematics more closely! 

Example 6. Arithmetic derivatives of primes to any power. 

4 ' = (2 2 )' = 2 ■ 2 1 = 4; 16' = (2 4 )' = 4 ■ 2 3 = 32. 

Exercise 1. Find 8 ' both by factoring 8 into 2x4 and by applying theorem 3. 

Now we can fill in the table a little more. 

Table 1 (c) 


N 

N' 


N 

N' 

1 

0 

9 

6 

2 

1 

10 

7 

3 

1 

11 

1 

4 

4 

12 


5 

1 

13 

1 

6 

5 

14 

9 

7 

1 

15 

8 

8 

12 

16 

32 
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A General Formula for Arithmetic Derivatives (I) 


Our next task is a bit more daunting. We would like to create a formula which will give us the arithmetic 
derivative of any positive integer, not just a power of a prime or a product of two primes. 

To accomplish this, we need two further facts. One is the extended Leibniz rule for calculus derivatives. 
Suppose we have a function which is a product of an arbitrary number of differentiable functions. 

Extended Leibniz Rule for calculus derivatives. 

d (d \ (d \ (d \ 

— (u^Uz ...U k ) = ( — Ui) u 2 U 3 ...U k + % I —U 2 1 U 3 ...U k + ••• + 11^2^3 ...(—J 

To understand what is going on in this rule, look carefully at each term. Each term is the product of all 
but one of the functions and the derivative of that missing function. The derivative can be thought of as moving 
like a wave through the products. 

We present the extended Leibniz rule without proof, but we will prove the arithmetic derivative version 
and this will give a solid hint for a proof of the calculus version. 

Theorem 4. Extended Leibniz Rule for Arithmetic Derivatives 

Let N — a 1 a 2 CL 3 ... a n , for positive integers a*. Then 

N' = (a 1 )'a 2 a 3 ... a n + a 1 (a 2 )'a 3 ... a n + a 1 a 2 (a 3 )'... a n + ...+ a x a 2 a 3 ... (a n )' 

Proof. Again, we will use induction. Lor n — 1, we have N' — (%)' and the theorem is 
satisfied. 

Assume the theorem is true for n — k — 1. Then for M = a 1 a 2 a 3 ... a k _ 1 , 

M' — ...a^ + a 1 (a 2 ya 3 ...a k _ 1 + ...+ a x a 2 a 3 ... (*) 

Now for any positive integer a k , let N = a x a 2 a 3 ... a fc _ 1 a fc . 

Then AT = ( a t a 2 a 3 ...a k _ 1 a k )' 

But by the definition, 

(a^ag = a x a 2 a 3 ...a /c _ 1 ■ (a fc )' + a k ■ (a x a 2 a 3 ... a k _ x y 

Substituting (*) we get 
(a!a 2 a 3 ... a^aj' = 

a!a 2 a 3 ... a k _ t ■ (a k )' + a k ■ ((.a i ya 2 a 3 ... a fc _ x + ... a fc _ x + ... + a!a 2 a 3 ... 

Distributing a k gives us 

N' = (di)'a 2 a 3 ... a k + a 1 (_a 2 ya 3 ...a k + ...+ a ± a 2 a 3 ... (a k y. u 

Exercise 2. Lind the arithmetic derivative of210 = 2- 3- 5- 7 

Combining this extended rule with our previous rules will complete our task of finding a general formula for the 
arithmetic derivative of any positive integer. Before we state it we need to recall the fundamental theorem of 
arithmetic. Every positive integer has a unique prime factorization. 
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The Fundamental Theorem of Arithmetic 


For any positive integer N there exists a unique factorization N — p 1 ni p 2 n2 p 3 n3 ... Uk 
where p t , n t are positive integers and p, is prime. 

For example, 12 = 2 2 ■3 and 72 = 2 3 ■ 3 2 . The order of the primes does not matter (sometimes we say “every 
positive integer has a unique prime factorization, up to order” to be clearer.) It is customary to list the primes in 
increasing order. If 1 were considered a prime number (it’s not) the fundamental theorem would be invalid 
since we could select any power for the 1 and the factorization would not be unique. 


A Combinatorial Digression 


Before we present our main theorem, the formula for the general arithmetic derivative, I want to point 
out that theorem 4 links the topic of arithmetic derivatives with the topic of combinatorics, which, loosely 
defined, is the branch of mathematics that deals with counting finite collections of objects. 

Example 7. Selecting a committee. 


Given ten people, how many ways are there to select a set of 6 of them? The order of the people selected does 
not matter - suppose that you are forming a committee. 


The answer to this question introduces a concept called combinations. We write 



and read this as “10 choose 6 .” The formula for evaluating this expression involves factorials. Factorials are 
written with an “!”; for example, 5! = 5- 4- 3- 2- l. 


Formula for calculating number of combinations when k objects are selected from a set of n objects, k < n. 

( n \ _ n - 
\k) k\(n — k)\ 

So the solution to example 7 is 

/10\ 10! 10-9-8-7-6 ! 10 ■ 9 ■ 8 ■ 7 

I 1 =-=-=-= 210 

V6? 6! (10-6)! 6 !-4! 4-3-2-1 

How does this tie in with arithmetic derivatives? The next example should clarify this. 

Example 8. Arithmetic derivatives and combinations. 


Find the arithmetic derivative of 2 ■ 3 ■ 5 

Using theorem 4, we see that (2 ■ 3 ■ 5)' = 2' ■ 3 • 5 + 2 ■ 3' ■ 5 + 2 ■ 3 ■ 5' 

But since 2,3,5 are all prime and thus have arithmetic derivatives of 1, this simplifies to (2 ■ 3 ■ 

5)'= 3-5+ 2-5+ 2-3. 
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Look at this result carefully. Note that the arithmetic derivative of a product of three prime factors 
turned out to be every possible product of two of those factors. There turned out to be th5ree such products. 
Counting the products is a matter of using the combinatorial formula for combinations. 

Recall the exercise above, when you found 210'. 210 = 2 • 3 ■ 5 ■ 7. We needed to find every possible product 
of three of those factors. 

an 


For any integer which is a product of n distinct prime factors, the arithmetic derivative will be the sum of 

(„ -1) p roduc(s - 

Example 6. Number of distinct products. 

60' = (2 ■ 2 ■ 3 ■ 5)' = 2' ■ 2 ■ 3 ■ 5 + 2 ■ 2' ■ 3 ■ 5 + 2 ■ 2 ■ 3' ■ 5 + 2 ■ 2 ■ 3 ■ 5' 

This is equal to2(2-3-5) + 2- 2- 5 + 2- 2- 3 

There are ( ) products but only 3 distinct products. How can we count the distinct products? 

Exercise 3. In calculating the arithmetic derivative of 3 4 ■ 5 2 ■ ll 3 , how many products would there be? How 
many distinct products would there be? 


A General Formula for Arithmetic Derivatives (II) 


Now let’s return to our main theorem. We have seen how the extended Leibniz rule allows us to take an 
arithmetic derivative of a product with an arbitrary number of factors. We have also seen that every positive 
integer can be written as a unique product of primes to powers. 

Theorem 5. General formula for arithmetic derivatives. 

Let N be a positive integer with prime factorization p 1 ni p 2 nz P 3 ri3 ■■■ Pk nk • Then 
N' = iVCrtipr 1 + n 2 p 2 ~ 1 + ■■■ + nfcPfc -1 ) 

Proof. 

AT = (Pi ni )'P2 n2 P3 n3 -Pk nk + Pi ni (P2 n2 )'P3 n3 -Pk nk + •"+ Pi ni P 2 n2 P3 n3 - (Pk nk Y 

— n 1 p 1 ni_1 p 2 Tlz P 3 Tl3 — Pk nk + Pi ni n 2 P2 n2 ~ 1 P3 rl3 — Pk nk +-f Pi ni P2 n2 P3 n3 — n kPk k l 

We factor out p 1 ni p 2 n 2 p 3 Jl3 ... p k Uk which gives us + n 2 p 2 _1 + —I- n k p k ~ 1 ') ■ 
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Example 7 Find 12'. 

12' = (2 2 ■ 3)' = 12 (| + i) = 12 Q = 16 


Finally, we can complete our table of the first 16 arithmetic derivatives: 

Table 1 (d) 


N 

N' 


N 

N' 

1 

0 

9 

6 

2 

1 

10 

7 

3 

1 

11 

1 

4 

4 

12 

16 

5 

1 

13 

1 

6 

5 

14 

9 

7 

1 

15 

8 

8 

12 

16 

32 


Exercise 5. Find the arithmetic derivative of 2 • 3 4 ■ 5 ■ 7 2 = 39,690 


Second Arithmetic Derivatives 


We will define a second arithmetic derivative just as we do with calculus derivatives - it will be the 
arithmetic derivative of an arithmetic derivative, and will be denoted N", which we can read as “N double 
prime.” 

We can partially fill in a table of second derivatives by taking the entry in the N' column and looking 
for that number in the N column. For instance, consider N — 6 as noted above in boldface. 6' = 5, so we find 
5 in the N column and note that 5' = 1. So 6" = 1. 

By doing this as often as we can, let’s begin a table of first and second arithmetic derivatives. 

Table 2 


N 

N' 

N " 


N 

N' 

JV" 

1 

0 


9 

6 

5 

2 

1 

0 

10 

7 

1 

3 

1 

0 

11 

1 

0 

4 

4 

4 

12 

16 

32 

5 

1 

0 

13 

1 

0 

6 

5 

1 

14 

9 

6 

7 

1 

0 

15 

8 

12 

8 

12 

16 

16 

32 



This was pretty successful! Note that 1" currently lacks any defined answer, since we have defined 
arithmetic derivatives as a function on positive integers. We will also feel this lack if we try to find third or 
higher order arithmetic derivatives, since for any prime p, p" — 0. We had better follow convention and define 
0 ' = 0 . 
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Let’s find more comparisons between calculus and arithmetic derivatives. A power function is said to 
have (calculus) derivatives that “vanish,” because for sufficiently high order, the derivative of that order is zero, 
and since the calculus derivative of zero is zero, any derivative of higher order is zero as well. 

Example 8. A function whose calculus derivative vanishes. 


f{x) — x 3 ,/' (x) = 3x 2 ,/"(x) = 6x,/"'(x) = 6,f^\x) — 0 


Since the derivative of 0 is 0, every derivative after the fourth is also equal to zero. So we can state: 

(x 3 ) = 0,n > 4 

We can see from the table that arithmetic derivatives may vanish also. In fact, the arithmetic derivatives of 
primes vanish at the second derivative. 10" = 1, so the arithmetic derivative of 10 vanishes at the third 
derivative. 


Arithmetic Differential Equations 


In calculus, an equation such as y' — 2x can be solved by finding a function whose derivative is 2x. 
This is called a differential equation because it contains derivatives. Differential equations can become much 
more complicated: y" + 2y' + y — 0 is one example. Students of calculus learn a large variety of methods for 
finding functions which are solutions to such equations. 

General methods for solving arithmetic differential equations would be very different from calculus 
methods. The analogy breaks down badly here. I am not aware of any work that has been done on such 
methods. However, it is interesting and instructive to look at some simple examples. 

Example 9. An arithmetic differential equation. 

Solve the equation x' = 14. 

We have seen that numbers which are the product of two distinct primes have arithmetic derivative which is the 
sum of those primes. (Theorem 2). Let’s look at all the ways that 14 can be written as a sum of positive 
integers. 

14 = 1 + 13 = 2 + 12 = 3 + 11 = 4+10 = 5 + 9 = 6 + 8 = 7 + 7. 

3 and 11 are primes, so their product 33 has arithmetic derivative 3 + 11 = 14. 7 is also a prime so 49 is 
another solution. We have no reason to conclude that we have found all solutions. Unlike the situation with 
calculus, finding some solutions doesn’t make finding further solutions any easier. 
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Example 10. Another differential equation. 

Solve x' — x 

In calculus, there is a set of functions which are their own derivatives - multiples of the exponential function 
fix) — e x . 

Examining table 2, we can see that 4 is its own arithmetic derivative. That is the only solution on our table. 
However, perhaps we can find a pattern and generate some more solutions. 

4 = 2 2 . Recall theore 3, the “power rule” for arithmetic derivatives. Now consider any integer of the form 
p v ,p prime. Its arithmetic derivative would be p ■ p p ~ 1 — p p . 

Thus, all integers of the form p p , p prime are solutions to x' — x. 

24 and 256 are additional examples. 

I invite readers to propose and solve further arithmetic differential equations. We can always reverse engineer 
them from the answers in the table (for example, since 3' = 1, we can create 
N = 3 N'. Indeed, p is a solution to N — pN' for any prime p.) Try to do better. 


Some Differences Between Calculus and Arithmetic Derivatives 


We have seen that arithmetic derivatives, which obey the Leibniz rule by definition, also have a version 
of the power rule. They can be extended to derivatives of higher order, and they can vanish or remain constant, 
just like calculus derivatives. However, they differ in almost every way. We must not confuse our use of the 
term “derivative” for arithmetic derivatives to convey any more than a superficial, if pleasing and sometimes 
surprising, comparison. 

One of the most helpful and basic rules for calculus derivatives is the sum rule: 


d df dc/ 

dx ^ + ^ dx + dx 


This rule does not hold for arithmetic derivatives. Note that 30' = 31 =£ 25' + 5'. 

Another difference is more fundamental. Calculus derivatives give us formulas for slopes of tangent 
lines to curves. We have not discovered any useful facts about numbers that their arithmetic derivatives 
provide. Not yet! Perhaps further investigation of this intriguing if highly artificial concept may yet lead to 
some good mathematics. However, as I hope I have shown, even if it doesn’t, the investigation is worthwhile 
because it helps us think logically, it helps us focus on the properties of derivatives, and it is intrinsically 
intriguing. 


Postscript: An Alternative Arithmetic Derivative 


In the definition of arithmetic derivative, we specified that p' = 1 for any prime. We could not prove 
this if we had not stated it. It is not implied by the Leibniz rule, but it was convenient for us. 

Suppose we had said, for instance, p' = 2 for any prime. This would give us another arithmetic 
derivative altogether. Let’s see if it would work. We will use the notation N~ for arithmetic derivatives of this 
type. We need to see if N~ is unique for every N. 
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Example 11. Uniqueness of an alternate derivative. 


12 = 2-6 = 3-4 

6~ = (2 ■ 3)~ = 2 ■ 3~ + 3 ■ 2~ = 2 ■ 2 + 3 ■ 2 = 10 
4~ = (2 ■ 2)~ = 2 ■ 2~ + 2 ■ 2~ = 2 ■ 2 + 2 ■ 2 = 8 

12~ = (2 ■ 6)~ = 2 ■ 6~ + 6 ■ 2~ = 2 ■ 10 + 6 ■ 2 = 32 
12~ = (3 ■ 4)~ = 3 ■ 4~ + 4 ■ 3~ = 3 ■ 8 + 4 ■ 2 = 32 

What if we defined a derivative, say N*, such that p* — p for any prime p. Would this work? Let’s let 
this be your final exercise. 

http://math.arizona.edu/~ura-reports/063/Sandhu.Aliana/Final.pdf 
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Modeling Biological Processes with 
Discrete Time Markov Chains (DTMC) 

Janice Fritz 

Introduction 

I plan to investigate the use and limitations of Discrete Time Markov Chains (DTMC) in modeling biological 
processes. DTCM depend on stochastic matrices and they are used to model processes in which the future 
depends only on the current state, not on the past. I've found several examples of DTMC-based models in 
Biology, including cell proliferation, cell migration, birth-death processes, enzyme kinetics, and DNA 
sequences. I hope to design my own simple model of a biological process (possibly cell differentiation, but I'm 
not willing to commit to that at this point) with a stochastic matrix that I can use to predict the result of the 
process. This is what I would demonstrate on my mathematics page. 

Modeling Biological Processes with Discrete Time Markov Chains (DTMC) 

Markov processes are named after Russian mathematician Andrey Markov (1856-1922) [1], Markov, whose 
younger brother and son were also notable mathematicians, worked on stochastic processes [1]. Stochastic 
processes involve random variables that change over time (or some other parameter) based on some probability 
[2, 4]. A Markov process is a type of stochastic process in which the outcome at any point depends only on the 
previous state and not on how the system arrived at that state [3]. It also requires that the number of states is 
finite and probabilities are constant [4]. 

As an example of a situation obeying the rules of a Markov process, consider the hypothetical intersection of 
two one-way streets, Elm St. and North St., at which 45% of vehicles approaching on Elm St. go straight and 
55% turn onto North St.. Of the vehicles approaching the intersection from North St., 75% go straight and 25% 
turn onto Elm St.. In this situation, the probability of a car continuing on its current street or turning onto the 
other street depends only on which street it approaches the intersection, not where the car was before 
approaching the intersection. Whether you are coming from the library or from the grocery store, if you are 
driving on North St., you are more likely to stay on North St. than to turn onto Elm. 

The outcome of a Markov process can be determined using a stochastic matrix. See Example 1 in the 
mathematics section to see how to determine the number of cars that will end up on North and Elm streets after 
the intersection. 

For some processes, the stochastic matrix can be applied to the outcome repeatedly to model the results over 
many cycles of the process. While the simple example above does not work this way since a vehicle won’t 
encounter the intersection again after passing it, we can devise a more general example that does. 
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Instead of a single intersection, consider a grid of North/South (N/S) and East/West (E/W) streets in which 15% 
of vehicles traveling on a N/S street turn onto the E/W street at any given intersection and 85% go straight to 
remain on the N/S street. Of the cars traveling on a E/W street, 5% will turn onto the N/S street at any given 
intersection and 95% go straight. Once a car goes through an intersection, it will eventually reach another 
intersection where it will turn or go straight based on the probabilities appropriate for the direction it is now 
going. See Example 2 in the mathematics section for an example. 

If a Markov process occurs at discrete time intervals or between discrete states rather than continually over 
time, it is a discrete time Markov chain (DTMC). Many Biological processes fit this model of changes over 
discrete time or between discrete states. 

At the population level, DTMCs can predict changes in population size and the time until the extinction of a 
population [5]. This model is based on the probabilities of a birth increasing the population by one, a death 
decreasing the population by one, or no change occurring during a given time interval. In order for the change 
in population to be limited to one, that is, not more than one birth or death occurring during the time interval, 
the time interval has to be sufficiently short [5]. This model is not as simple as those above because the 
probabilities of birth or death change as the number of individuals in the population changes, so the stochastic 
matrix changes with each iteration. Over many time intervals, the population will grow indefinitely, become 
extinct or reach a stable sustainable level. 

An example of using DTMCs at the molecular level is the modeling of changes in protein conformation [4]. 
Many protein molecules experience changes in their shape and this change in shape can affect how they 
function. Consider a protein that has three conformational states, Ci, C 2 , and C 3 . Each second, there is a certain 
probability that a molecule of the protein will switch to one of the other shapes. These probabilities can be used 
to create a transition matrix to predict the steady state amount of protein in each conformation [4]. This is of 
biological significance because the different conformations may have different strength, different binding 
affinity or different efficiency in catalyzing reactions. Changes in cellular conditions, such as pH, temperature, 
or presence of other compounds may affect the probabilities of switching between conformations, leading to a 
more or less effective population of protein molecules. 

I am interested in modeling cellular differentiation using DTMCs to predict the proportion of different cell types 
that would be present after a certain length of time. Cellular differentiation is the process of a stem cell 
becoming a specific mature cell type [ 6 ]. All the different types of cells in the human body are the descendants 
of a single zygote formed when a sperm fertili z ed an egg. The process begins with a mass of totipotent cells 
formed by repeated cell division of the zygote. Totipotent cells are not differentiated at all. They have the 
ability to give rise to any type of cell. In the formation of an embryo, the totipotent cells differentiate into one 
of three cell types: endoderm, mesoderm, and ectoderm. These cells are called pluripotent because they can 
give rise many different cell types, but not all. Endoderm cells eventually differentiate in to muscle, lung, and 
gut cells; mesoderm cells give rise to connective tissues like cartilage, bone, blood, and fat; and ectoderm cells 
become skin cells and neurons [ 6 , 7]. This process involves many steps though multipotent cells types that have 
been left out for simplicity. 
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The standard view of cellular differentiation is that it is a one-way process. Once a cell has differentiated, it may 
differentiate further along the same path into an even more specialized cell, but it cannot revert back to a less 
differentiated cell type and take a different path. See the figure below from [6]. 


Totipotent embryonic stem cell 



Pluripotent embryonic stem cells 


Endoderm line Mesoderm line Ectoderm line 


Human emt 
Irx 


. Q Q Q 

Multipotent stem cells 



0 


Heart Red Wood 
muscle cell 


0 



Skin Neuron 


However, there are situations in which reversion and re-differentiation may occur. It is necessary in animals 
that regrow amputated body parts, such as amphibians and lizards growing a new tail. There is some evidence 
that mammalian cells may also dedifferentiate under certain conditions [8]. An important area of current 
research is the use of induced pluripotent stem cells. These are differentiated adult cells that are artificially 
converted back into stem cells that can become many different cell types to create tissues and organs ex vivo 
[ 6 ]. 

While the standard process would still be interesting to model, I chose to create a DTMC model of cellular 
differentiation that included the possibility of some dedifferentiation, although this would be a rare event. A 
Spanish Wikipedia page provided the visual model of the process [9]: 


Multipotente 


> t Unipotente 
Espedficacao 



The first thing to consider in modeling the differentiation shown in the figure above is the transition 
probabilities. This model is extremely simplified in both the number of cell types and the number of steps. For 
example, a population of totipotent cells differentiates into more than two types of pluripotent cells, which 
54 The St. Clair County Community College Journal of Mathematics 













divide into more than two types of multipotent cells, which go through more than one step to become fully 
differentiated adult cells. Also, since the figure itself is only a representation of the general process instead of 
specific known pathways of differentiation, the transition probabilities are completely invented. Except for the 
totipotent cells, which all differentiate into pluripotent cells, there is at least a small probability that a cell type 
will remain the same rather than differentiating to the next level. The probability of dedifferentiating (moving 
back to a less differentiated cell type) is very small. See example 3 on the mathematics section for thel lxll 
transition matrix. 

The transition matrix has some interesting properties I noticed as I was setting it up. As with any transition 
matrix, the main diagonal indicates the probability of remaining in the same state. A quick glance at these 
values shows with cell types are transitory steps and which terminally differentiated cells that are unlikely to 
change. One cells type that does not follow this pattern is the cell type labelled C2. The represents a cell type 
like the spermatogonia in the testes. Every time a spermatogonium divides, it produces one spermatogonium 
and one spermatocyte. The spermatocyte will give rise to four sperm while the new spermatogonium with 
divide to produce a spermatocyte and another spermatogonium. This continual self-renewing of the 
spermatogonia allows men to produce sperm throughout their life time instead of eventually running out. 
Another interesting feature of the transition matrix is that values below the main diagonal are the probabilities 
of transition to a more differentiated state while values above the main diagonal are the probabilities of 
dedifferentiation. The later values are fewer and smaller than the former. 

In order to see how a population of cells will change over time, calculate the product of the transition matrix 
and a matrix representing the original cell population. To represent the original population of undifferentiated 
cells, we will assume there are 100 totipotent cells and no other cells types, a llxl matrix with 100 as the initial 
entry and zeros in the other positions. Multiplying this by the transition matrix yields a population with no 
totipotent cells and two types of pluripotent cells. Multiplying by the transition matrix again produces the next 
generation of cells. Repeating many times shows the eventual population of differentiated cells after n 
generations: P n = T n P 0 . See example 3 in the mathematics section. 

In this model, the discrete time interval for the Markov chain is one cell cycle. Since the population of cells 
will double with each time interval, the population matrix must be multiplied by 2 for each time interval or 2 n 
for each time interval. Since the population matrix changes each interval but the transition matrix remains the 
same, we can multiply the transition matrix by the scalar 2 to get P n = (2T) n P 0 , the population of cells, instead of 
just the proportion of cells, after n generations. 
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Mathematics 


Example 1. The intersection of North and Elm* 


A stochastic matrix represents probabilities staying on the same street or turning onto the other: 


Starting on Elm 

Starting on North 


0.45 

0.25 

Ending on Elm 

0.55 

0.75 

Ending on North 


The product of this matrix and a matrix representing the number of cars approaching the intersection on 
each street will predict the number of cars on each street after the intersection. If 40 cars approach the 
intersection on Elm, and 160 cars approach the intersection on North, 


|"o.45 

0.25 1 

[ 40 1 


r 58 1 

Lo.25 

0.75 J x 

L 160 J 

= 

L 148 J 


there will be 58 cars on Elm and 148 cars on North on the other side of the intersection. 


Example 2. The intersection of North/South and East/West streets 


Stochastic matrix representing the probability staying on the same street or turning onto the other: 


Starting on N/S 

Starting on E/W 


0.95 

0.15 

Ending on N/S 

0.05 

0.85 

Ending on E/W 


If there are 340 cars on N/S streets and 80 cars on E/W streets, after these cars reach an intersection, 


ro.95 

0.15 1 

f 340 1 


f 335 1 

1.0.05 

0.85 J x 

OO 

O 

= 

L 85 J 


there will be 335 cars on N/S streets and 85 cars on E/W streets. After these cars reach the next 
intersection on their respective routes, 


fo.85 

0.15 1 

I" 335 ' 


r 331 1 

Lo.05 

0.95 J x 

L 85 . 

= 

L 89 J 


there will be 331 cars on N/S streets and 89 cars on E/W streets. If this process continues through many 
intersections, the number of cars on the N/S and E/W streets reaches a steady state where the number of 
cars turning onto E/W streets is the same as the number of cars turning onto N/S streets to the outcome 
will no longer change. 
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Example 3. Cellular Differentiation 


Cell differentiation model: 



It, 


C1 

C2 

C3 

C4 

1 1 

It 

47 


-> D1 

D2 

■> D3 

D4 


Transition matrix [T]: 


From 

A 

B1 

B2 

Cl 

C2 

C3 

C4 

D1 

D2 

D3 

D4 

To 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

A 


0.6 

0.1 

0 

0 

0 

0 

0 

0.005 

0 

0 

0 

B1 


0.4 

0 

0.05 

0 

0 

0 

0 

0 

0 

0 

0 

B2 


0 

0.70 

0 

0.05 

0 

0 

0 

0.01 

0 

0 

0 

Cl 


0 

0.2 

0 

0 

0.5 

0 

0 

0 

0 

0 

0 

C2 


0 

0 

0.45 

0 

0 

0.05 

0 

0 

0 

0.005 

0 

C3 


0 

0 

0.5 

0 

0 

0 

0.2 

0 

0 

0 

0.001 

C4 


0 

0 

0 

0.95 

0 

0 

0 

0.95 

0 

0 

0 

D1 


0 

0 

0 

0 

0.5 

0 

0 

0.035 

1.0 

0 

0 

D2 


0 

0 

0 

0 

0 

0.95 

0 

0 

0 

0.985 

0 

D3 


-0 

0 

0 

0 

0 

0 

0.8 

0 

0 

0.01 

0.999- 

D4 


Original population [P 0 ] TP 0 = Pi 


TPi = TTP 0 = P 2 TP 2 = TTTP 0 = P 3 T 1() P 0 = Pi 0 


100 

0 

0 

0 

0 

0 

0 

0 

0 

0 

L 0 


0 

60 

40 

0 

0 

0 

0 

0 

0 

0 

L o 


0 

6 

2 

42 

12 

18 

20 

0 

0 

0 

L 0 


0 

0.6 

0.1 

6.3 

7.2 

1.8 

5 

39.9 

6 

17.1 


0 

0.207 

0.000 

0.549 

0.145 

0.094 

0.028 

35.734 

23.365 

17.591 

L 22.288- 1 


Population after n cycles = T n P 0 
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Consideration of population doubling each time interval: 


Pi = T(2P 0 ) 

P 2 = T(2P0 
= T(2(T(2P 0 ))) 
= 2 2 T 2 P 0 
= (2T) 2 P 0 
P n = (2T) n P 0 


*Something I learned while setting up the introductory examples: There are right and left stochastic matrices. 

In a right stochastic matrix, the rows add up to 1 and in a left stochastic matrix, the columns add up to 1 [10]. 
After considering this bit of information and how the product of the stochastic matrix and the initial state matrix 
yields the outcome state, I figure out the following: If your stochastic n x n matrix, A, is a left matrix, you 
multiply it on the left of the matrix of the initial state, B, a n x 1 matrix, to get the outcome state AB. If your 
stochastic n x n matrix, A, is a right matrix, you multiply it on the right of the matrix of the initial state, B, a 1 
x n matrix, to get the outcome state BA. I initially set up a right matrix and multiplied it on the left of the initial 
2x1 initial state matrix. When the sum of the entries of the outcome matrix wasn’t the same as the sum of the 
entries of the initial state matrix, I knew something was wrong. 
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Using Inverse Functions as an Aid to Compute a Derivative Value 


Nick Goins 


For the first edition of Continuum, I wrote an article illustrating how to use conjugates to compute the 
derivative of a radical function, for any index. In what follows, I will present an alternative, algebraic, 
procedure to compute these same derivatives. This technique will also apply to a wider class of functions, but 
will prove most useful in the case of radicals. As we will see, the method will require that the point of interest 
be on the x-axis. However, we will then generalize the method by showing that we can shift a function 
vertically so that the corresponding point is on the x-axis, before applying the method. 


Consider the limit of a difference quotient definition of the derivative, 


/'(a) - lim 
h-> o 


/(a + K) — /(a) 
h 


When working with this definition, one of the potential challenges is dealing with the /(a + h) term. In what 
follows, I will discuss a technique that I discovered in which we will use inverse functions as an aid to compute 
a derivative value, at a specific point. First, consider the following theorem. 


Theorem 1 : For a differentiable function /r(x), in which k( 0) = 0, and for a function /(x) which is 
differentiable at x = a, then 



Proof: Using L’Hopital’s Rule, 


lim 

h —>0 


f(a + k(h))-f(a ) 
k(h) 


= lim 
o 


f(a + k(h))k'(h) - 0 
k'(h ) 


lim f'(a + /r(/i)) 

/l—>o 


Then, interchanging the limit and the function /' and using the fact that k(K) is continuous and passes through 
the origin, 


= /' (lim a + k(h)^ = f(a + /r(0)) = /'(a) 

□ 

The next example will illustrate the theorem. 

Example 1: Let /(x) = x 2 + 4x and let /r(x) = sinx. Compute /'(2). 
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Solution: Using the formula in the above theorem, we get 


/'(2) = lim 

h—>0 


/(2 + fc(> 0 ) 


k(h) 


/ (2) /(2 + sin h) 

-= lim-— 

h—>o sin h 


/(2) (2 + sin h) 2 + 4(2 + sin h) 

-= lim--- 

h—>o sin h 


12 


4 + 4 sin /i + sin 2 h + 8 + 4 sin h — 12 8 sin h + sin 2 h 

= lim---= lim--- 

h—>o sin h h—>o sink 


lim 8 + sin h = 8 

h->0 


□ 

Regarding the formula in the theorem, if a + k(li) happened to be the inverse of /(h), then the first term in the 
difference quotient would reduce to h and so we would get 


/'(a) = lim 
h->o 


f(a + k(h)) 


k(h) 


/(a) nr\h))~ 

- lim-—— 

h-> o k(Jx) 


/(a) 


lim 

/l—>o 


h - /(a) 
k(K) 


In order for this limit to exist, we would need /(a) to be 0. That is, x — a must be an x-intercept of the function 
f{x). 

Example 2: Compute /'(0), where /(x) = sin -1 x. 


Solution: Using the formula in the theorem, the value of the derivative is 

/(0 + k(h)) — f (0) f(k(h)) sin _1 (/c(h)) 

/ (0) = lim-r——- - lim - - lim--- 

/i^o /c(h) h—>o k(h) h—>o k(h) 

If we choose k(h) — sin h, then k(h) is the inverse function of /, is differentiable at 0 and k(O) — 0. Then, 


sin 1 (sinh) h 

= lim-—- = lim —— - 1 


h—>o sink 


iwosinh 


□ 

In the remaining examples, we will set up the limit of a difference quotient and then find the necessary 
expression for k(h). 

Example 3: Compute /'(0), where /(x) = \fx. 


Solution: 


/'(0) = lim 

/l->0 


/(0 + k(h)) — f (0) 
k(h) 


fikih)) MW) 

lim - lim , - - 

h ->o k(h) h—>o k(h) 


With the goal of having the numerator reduce to h , we will choose k(h) — h 3 . Also, for this choice of k(h), we 
see that it has the property of being differentiable and passes through the origin. Then, 
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□ 


3 Jk(h ) VF h 1 

lim , „ N = lim —- lim — = lim — = oo 

/i—>o /c(n) /i—>o h 5 n^o h 6 fnor 


A Slight Generalization of the Above Technique 

If /(x) is differentiable and invertible but x = a is not an x-intercept of /(x) then define g(x) — /(x) — / (a). 
Then, g' (a) = /'(a) and x = a is an x-intercept of g (x). In other words, if we have every requirement for the 
above technique except for the fact that x = a is not an x-intercept of /(x), then we can shift the function 
vertically so that we do get an x-intercept since vertical shifts do not alter the derivative values at the 
corresponding points. This follows directly from the fact that the derivative of a constant is 0, so that if g (x) = 
fix) + c, then g'(a) — /'(a). 

Example 4: Compute /'(1), where /(x) = \[x. 


Solution: Since x = 1 is not an x-intercept of /(x), we will shift the graph down one unit and find the 
derivative at x = 1 of the shifted function. That is, let g(x) — /(x) — 1, so that $(1) = 0 and also $'(1) = 


/'(I) = g'(l) = lim 

h->0 


g(l + k(h)) 


k(h) 


g( i) .. g(X + k(h)) 

- lim-——- 

h-*o k(h) 


We want to choose k(h) so that 1 + k(h) is the inverse of g. Equivalently, we want the numerator, g(l + 
k(h)), to reduce to h. That is, we want \jl + k(h) — 1 — h. Solving this equation for k(h), gives k(h) = 
(h + l) 3 — 1. Notice that k(h) is differentiable at h = 0, and /c(0) = 0. Then, 

^ g(l + k(h)) i h v h 

h™ kfh) = I*™ (h + iy - 1 = I*™ h 3 + 3h 2 + 3h + 1 - 1 


— lim 


h 


h -+o h(h 2 + 3/i + 3) 


lim-———-- 

h—>o h 2 + 3h +3 


1 

3 


\ 

That is, / (1) = -, as expected. 

□ 

In example 4 if the index was larger than 3, say n, then expanding the term (h + l) n would’ve been more 
work. However, notice that when we evaluated the limit we really only needed the second to last term (why?). 
The following will prove to be very useful in the case of radicals with a larger index. 

Note: In the remaining examples, we are going to use the binomial theorem, in the following way, 


(h + a) n — hp(h) 
+ a n 


62 


The St. Clair County Community College Journal of Mathematics 




That is, expanding (h + a) n yields a sum of terms in which all but the last have a factor of h. We can factor out 
the h from the first terms, leaving a polynomial p(/t), and the last term is a n . The details for this are as follows, 

n n n 


(h + a) n = Q h k a n ~ k = ^ Q h k a n ~ k + a n = Q h k ~ x a n ~ k + a n 

k =0 k =1 k =1 

Then, define p(/t) = Zfc=iQ)^ /c_1 . In particular, we can write p(/t) as follows, 

n n n 

p(7i) = ^ Q h k ~ 1 a n ~ k = Q h 1_1 a n_1 + ^ Q h k ~ 1 CL n - k = na^ ^ Q 


fc=i 


k =2 


k =2 


so that 


p(0) = na n 1 


The function value p(0) is the coefficient of the second to last term in the polynomial p(/t). In example 4, we 
could’ve written (/i + l) 3 = hp(h ) + l 3 = hp(h ) + 1, and then p(0) = 3(1) 2 = 3. This will become more 
useful in the following examples, as we consider expansions with powers larger than 3. 

Example 5: Compute /'(3), where f(x) — \fx. 

Solution: Since x = 3 does not correspond to an x-intercept of /(x), we define g (x) = /(x) — /(3). That is, 
g (x) = Vx — V3. Then, 


/'(3) = p'(3) = lim 

h—>0 


g(3 + /c(/t)) -p(3) 
k(h) 


g(_3 + k(h)~) l/3 + k(h) - V3 

lim-—-- lim- —— - 

/i—>o /c(h) h-> 0 /c(/l) 


The goal in finding k(h) is that we want the numerator to reduce to h. Again, this is equivalent to 3 + k(Ji) = 
p _1 (/t), so we solve \/3 + k(Ji) — \[3 = h, for k(Ji). Doing so, gives 

k(K) = (h + V3) 7 - 3 


Substituting this into the above limit gives 


lim 

h-> o 


V3 + k(h) - V3 
k(hj 


— lim 

h —>0 


13+ (h+V3) -3-V3 
(h + V3) 7 - 3 


= lim 

h—>0 


h 


( h + y 3) -3 


To evaluate this limit using algebra, we could expand the seventh power, but that is much more work than we 
need to do. We will use the observation, (h + a) n = hp(K) + a n . That is, 


63 


Volume 2, Number 1, Continuum 




lim-=-- lim-=- 

h->0 (h + Vs) 7 - 3 h ^° hp(h) + (V3) - 3 

Reducing this expression, 

h /ill 

= lim -———-—- = lim -— 7 — = lim —— - —— 
ft-onp(n) + 3 — 3 ft-o/ip(/i) ft-op(/i) p(0) 

In the discussion above, we found the function value p( 0) = na n ~ 1 , where n is the index and a is the value at 
which we are finding the derivative, so that we get the following final result 


f (S') 


1 

7(VT) 


6 


□ 

Note: In example 3, /(/c(/i)) = h, so that k(h) = / -1 (/i). In example 4, x = 1 was not an x-intercept of /(x), 
so we shifted the function vertically downwards one unit. Then in that example, we found g( 1 + k(h)) — h, so 
that 1 + k(h) — g _1 (h) and so k(h) — p -1 (/i) — 1. In general, the relationship between the original function 
/(x), the value of a and the function /c(/i), is expressed as follows 


k(h ) — f 1 (h + /(a)) — a 


In example 5, we found k(h) — (h + VS) — 3, which has the same form as the boxed formula stated above. If 
x — a corresponds to an x-intercept of /(x) at the origin (so that a = 0 and /(a) = 0), then k(K) is exactly the 
inverse of /. If x = a is an x-intercept of /(x) but not at the origin, then k(h) is a vertical shift of the inverse of 
/. If a = 0, but / (a) =£ 0, then k(K) is a horizontal shift of/ -1 . Of course, a need not be zero and need not 
correspond to an x-intercept. The same expression works for k(K) in each scenario. In what follows, we won’t 
use the above expression for k(K), but will find it as we progress through the examples. The overall goal is to 
illustrate the algebra that is needed to evaluate these derivative values, not to find simplifying formulas. 

Example 6: Compute /'(ll), where /(x) = 1 Vx. 

Solution: Since x = 11 does not correspond to an x-intercept of /(x), we define g(x ) = /(x) — /(ll). That 
is, g(x) — 1 Vx — 1 VTl. Then, 


a(U+ *(A))-9(11) g(u + k(h)) ‘V 11 + m - ‘VlT 

/ (11) = q (11) — lim-r-TTT-= lim-——-= lim-- 

1 K J v K J h ^ 0 k(h ) h—*o k(h ) ft -0 k(h ) 


19 . 19 

Similar to the previous example, k(h) — (h + VTl) — 11, so that 
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19 


19 

"Jll + k(h) - ‘VTT J11 + (A + ! VTT) -11 - 'VTl 

lim- tttt -= lim--= lim ■ 


h 


/l—>o 


k(K) 


ft—>0 


{h + 1 Vn) 19 -11 


ft—>0 


0+ 19 VII) 19 -li 


= lim 


h 


— lim- 


h 


0 h P (h ) + CVTI) 19 - li fcpW + 11-H 


h 1 

= lim -——■ = lim 


7i->o hp(li) h—>o p (/i) p(0) ig^VII) 18 

The method presented above works for functions other than radicals, as illustrated by the next example. 

Example 7: Compute /'(5), where /(x) = lnx. 

Solution: First, define g(x) — /(x) — /(5). 

g(S + k(h))-g(S) gCS + km ln(5 + k(h)) — In 5 

q (5) = lim-——-- lim-——-= lim-——- 

h-> o k(Ji) h->o k(h) h—>o k(h ) 

Solving the equation ln(5 + /c(/t)) — In 5 = h, gives 

k(h) =e h+ln5 -S 

then substitution back into the limit, 


□ 


ln(5 + k(h )) — In 5 ln(5 + e^ 1115 — 5^ — In 5 
lim —--= lim -+--— 4- -= lim 


h 


ft+ln5 


ft—>o k(h ) h-*o e 

The expression inside the limit can be manipulated as follows 


h ^ oe h+\n5 _ 5 


v h r h v h 1 v 1 1 1 

“ ft™ e h e ln5 - 5 ~~ ft™ 5e h - 5 ft™ 5(e h - 1) 5 ft™ (e h - 1\ ~ 5 7 (e h - 1\ 

HH It-) 


i 

5 


□ 

The preceding examples can be summarized by the following theorem, which is only stated for completeness, 
and will not be used in any of the remaining examples. 

Theorem 2: For a function /(x) which is differentiable at x — a and is invertible, 


65 


Volume 2, Number 1, Continuum 



CM = LIS 7=T 


h 


h^o f~ 1 (h + /(a)) 


a 


Proof: The limit has indeterminate form so we can apply L’Hopital’s Rule, 


lim 


h 


lim 




lim- 


/-(ft+ /(«»-a -o^ [/ - 1(ft + /(a)) _ a] -o U- 1 )'(h + /(a)) 


Then, using the formula for the derivative of an inverse, 


1 

— lim - t 

/l—>0 p 

/'(f-Hh+m)) 

Since / is differentiable, then f is continuous and since / is invertible, we know that / _1 is continuous as well. 
Thus, we can interchange the operation of taking a limit with these two functions, 


1 


1 


1 



1 


■ 1 ■ 

1 


l/'CTWa)))] 


l/'(a)J 


F (f “ fa h +1(“)) 


□ 

In the remaining examples, we will consider functions built by composition. That is, the “inside” function is not 
just x. 


Example 8: Compute /'(3), where /(x) = ^2x + 1. 


Solution: Since x = 3 is not an x-intercept of /(x), we define g (x) = /(x) — /(3). Then, the derivative of 
/(x) evaluated at x = 3 is 


/'(3) = 5'(3) = lim 

/l—>o 


5(3 + fc(fc)) -5(3) 
k(h) 


g (3 + k(h )) V 2 (3 +W) + l- V 7 

lim -T773-- lim-T773- 

/l —>0 fc(/l) /l —>0 /c(/l) 


To find the expression for fc(7i), set the numerator equal to h and solve for k(Ji). 


V2(3 + fc(h)) + l- V7 = h 
= l(h+V7) 5 


7 

2 


2k{K) + 7 = (h+ VJf 


k(h ) 
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Substituting this expression into the limit, 


lim 

h —>0 




+ 7 - Ml 


\{h + Wf~l 


= lim 

h —>0 


(h + V 7) - 7 + 7 - V7 
\ [0 + V7) 5 - 7 


2 lim 

/l—>o 


S J(h + V7) 5 - V7 
(h + V7) 5 - 7 


2 lim 

h —>0 


h 


hp(h ) + (V7) 


2 lim 


h^op(h) p( 0) 


5(V7) 


□ 


Note: Consider the expression, ((/i + a) m + b) n , for positive integers m and n. Using a previously discussed 
formula, we can write this as follows 

((h + a) m + b) n = (7ip(/i) + a m + b) n 

Using the binomial theorem, this can then be written as follows 


n 

(Jip(Ji) + a m + b) n = ( hp(h ) + (a m + b)f = ^ Q (hp(h)) i (a m + b) n ~ l 

i =0 

Then pull off the i = 0 term and factor an h, out of the sum, 

r n 

h l p l (h)(a m + b) n ~ l 

Lt = l 



+ (a m + b) n = h 


u 

h i ~ 1 p i {h)(ja m + b) n ~ i 


+ (a m + b) n 


Label the summation in brackets in the latter expression as q(K). The following will show how to compute the 
value of q{0 ). 

n 

q (h) = ^ Q /i*-y (TlXa™ + h) n " i 

i=l 


n 

= Q h 1 ~ 1 p 1 (Jx){a m + h)”- 1 + ^ Q h i ~ 1 p i (h)(a m + h) n " i 

i =2 


= np(h)(a m + b) n 1 


/t 

+ j h l ~ 2 p l 

i =2 


p l (h)(a m + b) n 1 
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Thus, 


g(0) = np(0)(a m + b) n 1 


where 


((/i + a) m + b) n = hq(h) + (a m + b) n 


Example 9: Compute /'(1), where fix') — VV* + 4. 
Solution: Define g(x) = fix) — /(1). Then, 


rm = ,'co = iim g(1 + fc ^~ g(1) = 

h—>o kih) h—>o kih) h-> o kih) 


Using the same idea as the previous examples, we solve I \/l + kih) + 4 — V5 = h, for /c(/i). Then, 


fc(Ji) = ((h +V5) 2 -4) 3 - 1 


Which gives the following 


lim 

/l—>o 



1 + /c(7i) + 4 — 


Vs 


fc(Ji) 


h 

- lim- 

h + Vs) 2 



- 1 


Notice that the denominator has the same form as the expressions discussed in the note preceding this example. 
Taking advantage of that 


lim 

h 


h 


h 


i-5-= lim-5- 

°((ft + V5) 2 -4) -1 *~°h q (h) + ((VS) 2 + (-4)) -1 

Finally, using the previously discovered value of q(0), we get the final result 

1 1 

/'CD - 


= lim 


h 


h^ohqih.) q(0) 


3 • 2(V5) 2-1 ((V5) 2 + (-4)) 


6\/5 


□ 
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Example 10: Compute /'(4), where fix') — Vx 2 + 1. 
Solution: Define g(x) = /(x) — /(4). Then, 


/'(4)=5'(4) = lim 

/l—>o 


p(4 + /e(/i))-,g(4) p(4 + /e(/i)) V(4 + /c(/i)) 2 + 1 - Vl7 


fc(Ji) 


= lim 

h —>0 


k(h) 


— lim- 

?i-> o 


fc(Ji) 


As the reader can verify, we will choose k(h) = (h + VT 7) — 1 — 4, so that the above limit becomes 


= lim- 

/l ->0 


h 


l(h+ VTT) -1-4 


To evaluate this limit, we will use a conjugate, 


lim 

h^O 


h 


(h + VTT) 3 -1 + 4 h VTT) -1 + 4^ 


(h + VTT) — 1 — 4 J(h + VT7)- -1 + 4 


= lim 

h —>0 


(h + VTT) - l - 16 


Using the binomial theorem again, 


= lim 

h —>0 


h (J( h + VTT) -l + 4j h (h + VTT) - l + 4j + VTT) 3 - l + 4 


hp(h) + [VTT) - 17 


lim 

h—>0 


hp (h) 


lim- 

h —>0 


p(h) 


Pi 0 ) 


In the above example, p( 0) = na n 1 = ?>{VTT) , so that f{VTT) — 


i(Vl7) 


□ 


In example 10, we used the conjugate for a square root function because of the squared term in the radicand. If 
that exponent was a larger integer, say n, then we would end up having to use a conjugate for a radical with 
index n. This method is presented in the first edition of Continuum and the final example will illustrate using 
the method of this article along with the method from that aforementioned article. 


Example 11 : Compute /'(2), where fix) — Vx 4 — 5. 
Solution: Define gix) — fix) — /(2). Then, 


r (2) =g'( 2) = lim gC+fcWO-flg) _ lim g( 2 + *W) 
7 w B y J h^o kih) h^o kih) 
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lim 

h—>0 


V (2 + k(h)Y - 5 - VTT 

TO) 


Solving for /c(/t), gives /c(/t) = (h + VTT) +5 — 2. Then, substituting this back into the above limit gives 


2 + (h + VTl) +5-2 -5- VTT 


- lim- 

/ l ->0 


I (h + VTT) +5-2 


- lim — 

/l~+0 4 


h 


(h + VTT) +5-2 


To evaluate this limit, we need to use a conjugate for a radical of index 4 (see the first edition of Continuum). 
To find the appropriate conjugate, consider the following factorization 

a 4 — b 4 — (a — b)(a + b)(a 2 + b 2 ) 

So, if the denominator in the above limit is "a — b" and if we want to take that fourth root to the fourth power, 
we will multiply by (a + b)(a 2 + b 2 ). That is, 


h 

— lim - - 

h ^° 4 U 7 , - n 7 

Jo + VTT) +5-2 


ft 

V + VTT ) 7 + 5 + 2 )( 


V+VTT ) 7 + 5 ) + 2 2 ) 

ft 

V + VTT ) 7 + 5 + 2 ) | 

:g 

V+VIT ) 7 + 5 ) + 2 2 ) 


h 0 * + VIT ) 7 + 5 + 2) Ojo + VIT ) 7 + 5) + 2 2 

(O+vnfTT) -2* 

A (0 + VIT ) 7 + 5 + 2) ff 4 J(h + VIT ) 7 + 5) + 2 2 

- lim-0- 

h ^° 0 + VTT ) +5-16 
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h 


lim 

/ l ->0 


lhp(h) + (VlT ) 7 + 5 + 2 j ^ 4 Jhp(h ) + (Vll ) 7 + 
hp(h) + (Vll) — 11 



(yhpQi) + 16 + 2) U\jhp(K) + 16) 2 + 4 ) 

lim-—- 

/i—>o p(n) 

_ (vis+2) ((Vie) 2 +4) 

P(0) 

4-8 

" 7(Vn) 6 

32 

” 7(VTT) 6 


71 


Volume 2, Number 1, Continuum 



Johann Bernoulli 


Bom in Basel, Swiss Confederacy, on July 27 th , 1667, seventeen years before Leibniz would publish his 
earth-shattering paper on calculus, Johann was the younger brother and lifelong rival of his equally famed 
mathematician brother, Jakob. (Jakob introduced the term integral in 1690; Johann claimed it was his coinage.) 
His father wanted him to be a merchant, but he was excited by mathematics. His first work in mathematics was 
Solutio problematis funicularii, which appeared in Acta eruditorum (the same journal Leibniz’s paper appeared 
in) in 1690. 

Johann made fun of his brother frequently. When Jakob though he had correctly described the shape of 
a sail, pointing out that in different circumstances it might take different shapes, Johann seems to think he is 
claiming that the curve which describes the shape of one sail is made up of parts of other curves [Peiffer]. 

When Jakob shared with Johann the differential equation that yields the answer, Johann solved it and said: 
“Once more he forces me to complete the solution that he has begun and developed until this equation, after 
which he apparently gave up” ( Journal des savants, 1692, p. 189). 

It seems likely that Johann did not want to be outshone by his brother, or always considered as 
secondary to the great Jakob. In a letter to Leibniz, Jakob claims to have been the first to understand Leibniz’s 
calculus. When Jakob found a formula for curvature, Johann printed it on cards and passed them out as his 
business card! Jakob then translated his formula into polar coordinates and announced it as “ inconnu meme de 
monfrere ” - “unknown even by my brother.” [Peiffer] 

Johann wrote to L’Hopital on January 12, 1695, speaking of his brother: “He is a misanthropist in 
general and does not even spare his own brother.. .He is filled with rage, hate, envy and jealousy against me.” In 
a later letter: “You would not believe how much this brother, unworthy of the name, hates me, persecutes me 
and tries to destroy me.” [Peiffer] 

In Paris through most of 1691, Johann taught calculus to the Marquis de L’Hopital, who wrote the first 
book on the subject in 1696, Analyse des infiniments petits. 

Johann became a Doctor of Medicine in 1694 when he applied differential calculus to muscular 
contractions. 

Johann proposed a problem, called the brachistochrone problem, in Acta eruditorum in June 1696: what 
is the curve of quickest descent of a body moving from a higher point to a lower point in a plane; or, from 
Wolfram Mathworld, “Find the shape of the curve down which a bead sliding from rest and accelerated by 
gravity will slip (without friction) from one point to another in the least time. The term derives from the Greek 
[IpaxiGTOG (brachistos ) "the shortest" and xpovoa ( chronos ) “time, delay.” ” 

Five solutions were published in the magazine almost a whole year later, in May, 1697. One of them 
appeared anonymously. 

At this time, Newton was retired, and he was working in the Royal Mint. He got home one day and saw 
the problem in a letter from Johann. It had been a long day. He was tired. But he sat down and solved the 
problem. Right then and there, in one night. He sent his answer to the magazine anonymously. It had taken 
Bernoulli two weeks to solve the problem 

When Bernoulli read the anonymous solution, he knew right away that Isaac Newton had come up with 
it. How did he know? Well, what he said was, 

“I know the lion by his paw.” 

Johann arrived at a solution by thinking about a beam of light. He knew, according to Fermat’s 
Principle, that whatever path between two points requires the least time, is the path that a beam of light will 
always follow. He imagined how a beam of light would move if affected by gravity. 

Leibniz took one week to solve the problem. 

In 1700, Johann’s son Daniel was born. Daniel would also become a mathematician, and a friend of the 
great Euler. According to E.T. Bell [ Men of Mathematics] Daniel was “kicked out of the house” for winning a 
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prize which his father had also competed for. Johann also stole one of his son’s papers, changed the name and 
date, and submitted it as his own work. Nikolaus and Johann II were the other sons of Johann Bernoulli, and 
they too were fine mathematicians. 

One of the most important activities of Johann was his fierce partisanship of Leibniz in the calculus 
priority dispute. “Johann saw himself as Horatio, bravely defending Leibnizian calculus from the arrogant, 
misguided English.” [Tent] Johann fostered the idea that Newton had plagiarized Leibniz, and in a letter of 1713 
he attacked Newton’s character - he later denied having written the letter. Johann encouraged Leibniz to 
publish “challenge problems” which he felt the Newtonian mathematicians would be unable to solve with what 
he believed were their inferior methods. “Newton would, as you know, find himself in difficulties.” (Hall, p. 
216). The French Academy of Sciences took Leibniz’s side, printing an attack by Bernoulli on Newton’s 
Principia without allowing a response. (Hall, p. 214). After Leibniz’s death, Bernoulli did try to mend fences 
with Newton, writing “I desire nothing so much as to live in good fellowship with him, and to find an 
opportunity of showing him how much I value his rare merits, indeed I never speak of him save with much 
praise.” (Hall, p. 238). 

One of Johann’s greatest contributions was what he referred to as “exponential calculus” (now 
considered just a part of calculus) which contained the rule d(lnit) = du/u. However, Johann also believed 
that logs of negative numbers existed and were real numbers. “He drew this remarkable, counterintuitive 
conclusion by applying what we would call the chain rule in taking the derivative of ln(—x).” [Bedard] 

Another tremendously important contribution of Johann Bernoulli to mathematics is an indirect one: he 
was the tutor of the great Leonhard Euler. Euler honored Bernoulli, calling him “illustrious.” 


Bedard, Paul: “Euler and the Bernoullis: Learning by Teaching” MAA Convergence August 2015. 

Bell, E.T: Men of Mathematics , Simon & Schuster, 1937 

Hall, Alfred Rupert: Philosophers at War: The Quarrel Between Newton and Leibniz , Cambridge University 
Press, 1980. 

Peiffer,Jean: “Jacob Bernoulli, Teacher and rival of his brother Johann,” Journal Electronique d’Histoire de 
Probabilite et de la Statistique , Vol 2 No. 1 November 2006) 

Tent, M.B.W. : Leonhard Euler and the Bernoullis: Mathematicians from Basel , Taylor and Lrancis Group, 
2009. 
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Questions 


1. List 5 noted mathematicians from the Bernoulli family. 


2. What seems to have been the attitude of Bernoulli towards the intellectual property of others? 


3. What is the name (no longer used) of the branch of calculus which Johann Bernoulli was a pioneer in? 


4. What error did Johann make about logarithms? 


5. Applying the chain rule and ignoring the domain issues, find the (nonexistent) derivative of y = 
ln(—x), x > 0 


6 . How did Johann sole the brachistochrone problem? 


7. Johann said of Newton, “I never speak of him save with much praise.” What is one quote which belies 
this claim? 


8 . How did Johann hope to humiliate British mathematicians? 
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9. Other than Euler, name a famous student of Johann Bernoulli. 


10. Who introduced the term integral ? 


11. What did Johann print on his business cards? 


12. How long did it take Leibniz to solve the brachistochrone problem? 


13. How long did it take Newton to solve the brachistochrone problem? 


14. What earned Johann Bernoulli a doctorate in medicine? 
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Reparametrization by Arc Length 


Nick Goins 


For a vector function f (£), the input variable £ is referred to as a parameter. The word parameter is also used when 
referring to the arc length parameter, the context should make it clear which of the two uses is intended. In what follows 
we will discuss the topic of changing the input parameter for a vector function and what impact this has on the graph of 
the vector function, the description of the points on the graph and of the speed function. 

Recall that the arc length, |v(£) | dt is the length of the curve representing f (£) from the point corresponding to £ = a to 
the point corresponding to £ = b, more precisely, the length of the curve from the terminal point of the vector f (a) to the 
terminal point of the vector r(b). The notation, P(a), is frequently used for the terminal point of r(a). We can then 
create a function which measures how far we’ve travelled from the terminal point of f (a) after a time £, by replacing the 
upper limit of integration with the variable £. We call this function the arc length parameter, s(£) = J^ |F(r)| dr and we 
refer to a as the basepoint. Again, more precisely, the basepoint is the terminal point of the vector f (a). 

1. Let f (£) = (3£, £ + 4). Find the arc length parameter with basepoint corresponding to £ = 0. Then compute s(0), s(l) 
and s(VT0). 

2. Let r(£) = (£,£ + 2). Find the arc length parameter with basepoint corresponding to £ = 2. Then compute s(0), s(l) 
and s(V2). 


Note: For a vector function r which is not constant, its arc length parameter satisfies the following 

s(£) = I dT =* — = > 0 



dr(z) 

ds 

■-r —\ _ _ 

dr{t ) 

L 

dz 

Cl l —r - 

dt 

dt 


That is, the function s(£) is increasing. One implication of this is that s(£) has an inverse function. The following 
describes the inputs and outputs for s(£) and s _1 (£) 



That is, the function s(£) gives the arc length on the curve described by r(£) from t = a (the basepoint) to the point 
corresponding to £ = c. The inverse function takes an arc length l and gives the value of £ in which the arc length between 
£ = a and £ = c is l. In particular, if we find the arc length parameter with basepoint P(a), then s(a) = 0 and so 
s -1 (0) = a. 
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3. Let r(t) = (3t, 21 — 1). Find the arc length parameter with basepoint corresponding to t = 1. Then find the inverse of 
this function. That is. find s _1 (t). 

4. Let r(t) = (3t, t + 4). Using your work from exercise #1, find the inverse of the arc length parameter. That is, find 

-> 12 — 2 —\ 

5. Let r(t) = (-(3t — 1 ) 2 , -(f — 1 ) 2 ). Find the arc length parameter with basepoint corresponding to f = 0. Then find 
the inverse of this function. That is, find s _1 (t). 


Example: Consider the vector functions r(t) = (2t , t + 1) and R(t) = (6t, 3t + 1), which represent lines in the xy-plane. 
For t = 1, the point on the graph for r(t) is (2, 2) and the point on the graph for R(t) is (6,4). Similarly, the point on the 
graph for r(t) when f = 0 is (0,1) and the point on the graph for R(t) when t = 0 is also (0,1). Thus, the graphs pass 
through the same point. To find the slope of the line represented by r(t), choose two points on the line, such as (2 a, a + 

1) and (2 b, b + 1). The slope through these points is 

Ay (b + 1) — (a + 1) b — a 1 
Ax 2b — 2a 2 (b — a) 2 

Similarly, to find the slope of the line represented by R(t ), choose two points on the line, such as (6 a, 3 a + 1) and 
(6b, 3b + 1). The slope through these points is 

Ay _ (3b + 1) - (3a + 1) _ 3(b - a) _ 1 
Ax 6b — 6 a 6 (b — a) 2 

Therefore, the lines are the same. Notice the relationship, R(t ) = r(3t). 

Definition: For a vector function f(t), a reparametrization is another vector function R(t) which has the same graph as 
r(f), but the points are labelled differently. 

In the above example, the point (4,3) is on the graph of f(t), and corresponds to f = 2. The point is also on the graph of 
-» 2 

R(t) and corresponds to f = -. So, the points are the same, they are just labelled differently depending on which vector 
function we are referring to. 

Constructing a Reparametrization: For a vector function, r(t), a reparametrization will have the form r(/(f)). 

Note that there are some restrictions on the function /(f). In particular, the domain of / must be M and we need the range 
of / to match the domain of r. Thus, the domain and range of / must be M. In addition, we want / to be monotonic. For 
example, if r(t) = (t, t 3 ) and /(f) = t 2 , then r(/(f)) = (t 2 , t 6 ). The graph of r(/(f)) does not match the graph of r(t) 
(why?). 

Example: From the previous example, we can see that R(t ) = (6t, 3t + 1) is a reparametrization of r(t) = (2t, t + 1) 
because R(t) = r(f(t )) where /(f) = 3t. 
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Example: Let r(t) = (t 2 , e t ),/(t) = At and g(t ) = —2 1 + 5. Then the following are both reparametrizations of r(t), 


r(/(t)) = ((4f) 2 ,e 4t ) = (16f 2 ,e 4t ) 
r(#(t)) = ((—2t + 5) 2 ,e _2t+s ) = (4t 2 - 20t + 25, e~ 2t+s ) 


Thus, a vector function can have infinitely many different reparametrizations. 

6. Let r(t) = (t^t 2 ^. 

a ) Sketch the graph of f(t) by first filling in the following table. Label the point on your graph corresponding to f = 0 
and f = 1. 


input 

output 

point on graph 

0 



1 



-1 



2 



-2 




b ) Let /(f) = 3t. Sketch the graph of r(/(f)) by first filling in the following table. Label the point on your graph 
corresponding to f = 0 and f = 1. 


input 

output 

point on graph 

0 



1 



-1 



2 



-2 
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7. Let r(t) = (t 2 + t,t + 2). 

a ) Sketch the graph of r(t) by first filling in the following table. Label the point on your graph corresponding to f = 0 
and t = 1. 


input 

output 

point on graph 

0 



1 



-1 



2 



-2 



-0.5 




b ) Let /(f) = t + 1. Sketch the graph of r(/(t)) by first filling in the following table. Label the point on your graph 
corresponding to f = 0 and t = 1. 


input 

output 

point on graph 

0 



1 



-1 



2 



-2 



-1.5 




In the above exercises, notice that reparametrizing did not change the graph, it only changed the description of each point, 
as you illustrated by labelling the points corresponding to f = 0 and f = 1. 

Reparametrization by Arc Length 

For a vector function r(t) and for a fixed basepoint, P(a) (that is, the point on the graph corresponding to f = a), we are 
going to label a point by its distance along the curve from P(a). 
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H "* 'n* 



The above provides a visual for the relationship s(a ) = 0 and so s 1 (0) = a, that is the same point is either labelled as 0 
or as a, depending on the vector function description of the curve. 

Definition: For a vector function r(t), the speed function is s'(t) = \v\. 

Note: The unit tangent vector is T = so that v = \v\T. This can then be written as follows 

dr 

Tt = s 'mm 

and shows that we can factor the derivative of a vector function into two components. That is, the derivative is the product 
of the speed and tangent vector which shows that the derivative has a dynamical component and a geometric component. 

Procedure to find the Reparametrization by Arc Length: 

For the vector function r(t) with basepoint corresponding to t = a. 

1. Find the arc length parameter s(£). That is, s(t) = J^\v(t)\ dr. 

2. Find the inverse function, s -1 (t). 

3. Simplify the composition, R(t ) = r(s _1 (t)). 


8. Let r(t) = (3t, t + 4). Reparametrize r(t) by arc length, using f = 0 as the basepoint for s. Use your work from 
exercise #4. 

9. Let r(t) = (2t + 1, 3t — 1). Reparametrize r(t) by arc length, using t = —1 as the basepoint for 5. 
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10 . Let f(t) = (2 1 , t + 1). 

a) Find the point on the graph of r(f) corresponding to t = 1. 

b ) Find the arc length parameter s(t), with basepoint corresponding to f = 0. 

c) Find s _ 1 (t). 

d) Compute s _ 1 (0). 

e) Compute the arc length of r(f) from t = 0 to t = 1. 

/) Reparametrize r(f) by arc length. That is, find R{t) = r(s _ 1 (t)). 

g) Evaluate R(t) using the value of the arc length from part c as the input. That is, the t value is the arc length of r(f) 
from f = 0 to f = 1 . 

h ) Find the point on the graph of R (f) corresponding to f = V5. 

V) Find the speed function for r(f) and for R(t). 


Note: One result of reparametrizing a vector function by arc length, gives a speed function which is identically equal to 1. 
The graph of the reparametrization is identical to the original graph, the only change is the speed on the curve. Thus, in 
this case the derivative of the reparametrized vector function is identical to the unit tangent vector. 


11 . Let r(f) = (t — 2, 

4 

a) Reparametrize r(t) by arc length, using t = — -as the basepoint for s. 

b) Compute s - 1 (0). 

c ) Show that the speed function for the reparametrization is 1. 

12. Let r(t) = (- 7 -^ — 1, -j— 0 . Prove that the curve is the unit circle centered on the origin, by reparametrizing r(t) by 
arc length, using f = 0 as the basepoint for s. 
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Higher Order Derivatives of Functions with Multiple Outputs 


Rebekah Muzzi 


Derivatives of real-valued multivariable functions can be found using matrices. But when a function has 
multiple variables and multiple outputs, its derivatives are too complex to be expressed by a single matrix. We 
can, however, express its derivatives by creating matrices with matrix entries. 


The first derivative of a function of the form /: R n -» M is the gradient vector 


Df — V/ — [f Xl fx 2 fx n \- 

The second derivative is given by the Hessian matrix 

fxl fx 1*2 ■■■ fx lXn 
fx 2X1 f x 2 fx lX 2 

.fx n x-i f Xn - 



For / 




where f k = f k (x v 


Jacobian matrix, is given by 


x n ), the first derivative, also called the total derivative or the 



\dA 

df i 

d/il 

[VAi 

v/ 2 


dx ± 

df 2 

dx 2 

df 2 

dx n 

df 2 

= 

dx ± 

dx 2 

dx n 

yfm. 


dfm 


... 

02 


.dx 1 


dx n J 


We can extend this pattern to find the second derivative of functions of the form /: M n —> ¥l m by listing the 
Hessian of each component function. 



/*i\ 

( f f\ 

\ H fl] 

For / 

1 x 2 \ _ 

. where f k = f k (x lt ..., x n ), the second derivative is given by D 2 f — 

Hf 2 


\^nj 

\fml 

Hfm- 


/X\ / xy 3 + z 5 \ 

Example: Find Df and D 2 f for / ( y } = xye z 

\3 x 3 +y 2 z 4 / 


82 


The St. Clair County Community College Journal of Mathematics 



Solution: First, find the gradient vectors of ff 2 , and f 3 and combine them to make Df. 


Vfi — [y 3 3 xy 2 5 z 4 ] V/ 2 = [ ye z xe z xye z ] V/ 3 = [9x 2 2yz 4 4y 2 z 3 ] 



[y 3 

3xy 2 

5z 4 ] 

Df = 

[ye z 

xe z 

xye z ] 


[9x 2 

2yz 4 

4y 2 z 3 ] 


Then find the Hessian matrices of f\,f 2 , and f 3 . 


Hfi = 

' 0 3y 2 O' 

3y 2 6xy 0 

Hf 2 = 

' 0 e z ye z ' 
e z 0 xe z 

Hf 3 = 

18x 0 0 

0 2z 4 8yz 3 


. 0 0 5z 4 . 


ye z xe z xye z . 


0 8yz 3 12y 2 z 2 


Finally, combine the Hessians into one matrix, D 2 f. 



' 0 

3 y 2 

0 ' 


3y 2 

6 xy 

0 


. 0 

0 

5z 4 . 


0 

e z 

ye z ' 


e z 

0 

xe z 


ye z 

xe z 

xye z . 

18x 

0 

0 

0 

2z 4 

8yz 3 

0 

8yz 3 

12y 2 z 


We can verify that the second derivative works by using it to find approximations. First, we need a formula for 
approximating function values with second derivatives. The second difference quotient for a function of the 
form /: R R is the difference quotient of the difference quotient. 


D 2 (f, X) h) 


D(f,x + h]h ) — D (/,x; h ) 
h 


/(x + h + h) — f{x + h) f(x + h) — fix ) 
h ~ h 

h 


fix + 2 h) — 2 fix + h) + fix ) 
h 2 


When h is small, the second derivative is given by the limit 


/"(*) 


lim 

h-> oo 


fix + 2h) — 2 fix + h) + fix ) 
h 2 


and fix ) can be approximated by 


fix) 


fix + 2h) — 2 fix + h) + fix) 
h 2 
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Solving for f(x + h), 


/(x + Ji)« -^h 2 f"(x) + ^f(x + 2h) + i/(x) 


Since we are working with multivariable functions, we will use the equivalent formula for vectors where the dot 
product is used for multiplication: 

f(p + h) * + \f{p + 2h) + \m 


/1.1\ /X\ j xy 3 + z= 

Example: Find an approximation for / [ 1.1 where / I y I = I xye z 

Vl.l/ \z' \3x 3 +y 2 z 4 


n\ - ( ai 

Solution: Identify p = ^lj , h = ^0.1 


From the previous example, 


D 2 f = 


' 0 

3 y 2 

3y 2 

6 xy 

. 0 

0 

0 

e z 

e z 

0 

ye z 

xe z 


D 2 f\ P = 


0 8 yz 3 12y 2 z i 


0 3 O' 

3 6 0 

.0 0 5. 

0 e e 

e 0 e 

.e e e. 

18 0 0 ' 

0 2 8 

. 0 8 12 . 


Substituting into the formula, 


/ 1.1 
/ 1-1 
Vi.i 


0 3 O' 

3 6 0 

.0 0 5. 

0 e e 

e 0 e 

.e e e. 

18 0 0 ' 

0 2 8 

. 0 8 12 . 


1 

+ 2 f 


1 

+ 2 f 
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1.8 0.572 0.272 

0.572 0.8 1.072 

0.272 1.072 1.972. 


'0.1\ x /4.562X 
0.1 1 + - 4.781 + 

" \ n * *-i 


.0.1/ 


.8.170> 



1 /0.2644X / 

- 0.2444 + 

L VO.3316/ V 


2.281\ 
2.391 + 
4.085/ 

/l.r 

Check the approximation by directly evaluating / 1.1 

Vl.l, 







Before we can find third and higher order derivatives of functions with multiple outputs, we need higher order 
derivatives of multivariable functions with single outputs. Let's observe the pattern so far: 


First Derivative: 


Df = Vf = \f Xl f 


X 2 


fx n\ 


Second Derivative: 


We can also write this as 


Hf = 


fx\ fx 1*2 

fx 2*1 / x\ 


fx x x n 
fx 1*2 



Yfx r 

,%i 


fx 

2 

n J 


1 

CD 

JR 

fx 2 

fx J 


r 9 

dx 1 Df 

Hf = 

dx 2 [fx ' 

fx 2 

fxn\ 

— 

d 

d x 2 °f 


L„ ifx ■ 

fx 2 

f%n\ 


d 


So the next derivative can be found by taking the partial derivatives of the previous derivative with respect to 
each variable. 


The third derivative of a function with a single output is 
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r d 


D 3 f = 


dx 1 

d 


dx 2 

d 


IdXr 


Hf 

Hf 

Hf 


f x\ fx 1 x 1 x 2 
fx 1 x 2 x 1 f x ± x% 


Ux 1 x n x 1 

f: 

f 


x 2 xl 


fx 


*2*1*2 


/* 2 * 2*1 




X 2 x n x 1 


f: 

fx n X 2 X 1 


x n xl 


fx 


fx-nX^X-2 
f X n x I 


fx\X x X n 
fx\X 2 X n 

f X-^Xn 
fx 2 x±x n 

-/* 2 * 1*2 


f. 


* 2 *ri 


fx 


X n X n Xl 


fx n x 1% 
fx n X 1X2 

fxl 


and the fourth derivative is 


D 4 / 


r d 

r d 

d 

d 

dx 1 

^-Hf 

ldx 1 

^~ H f - 

dx 2 

• ^—Hf 

dx n . 

d 

r d 

d 

d 

dx 2 

^-Hf 

ldx 1 

^ — H f - 

dx 2 

dx n . 

d 

r d 

d ' 

d 

-dx n 


a * 2 Hf ■ 

■ Sx„ Hf . 


3 2 (H/) 


d\Hf) 

dx\ 

dx x dx 2 

d X-^ d Xyi 

3 \m 

d\Hf) 

d 2 (Hf) 

dx 2 dx 1 

3 *| 

dx2 d Xyi 

3 W) 

d 2 {Hf) 

d 2 (Hf ) 

.d Xu d x-^ 

d Xyi d x^ 

dx2 


Since even derivatives will be square matrices like the Hessian, we will notate them in terms of H. If k is even, 


D k f — H2f 


Example: D 4 f = H 2 f 

So the general formula for the /cth derivative is 


D k f 


k -1 -t 


-—H 2 / 

dx ± 


k-l 


-—H 2 / 

dx 2 


g k-l 

L 


if k is odd and 


D k f = m f = 


d 2 k ~ 2 

H ~f 

a2 k—2 

H~f 


dx 1 dx 2 


k—2 

H~f 


k—2 


q2 k — 2 


dx 2 dx 1 


dx 


2 H~f 


dx 1 dx n 

g2 k—2 


H 2 f 

k—2 

H~f 


Ldx n dx 1 


k—2 

H~f 


dx n dx 2 


k—2 

H~f 


dx 2 dx 

d 2 k ~ 2 

hn—f 


if k is even. 
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Now we can find higher order derivatives of functions with multiple outputs. The kth derivative of a function of 
the form /: R n M m is the column vector where the j th entry is the k\h derivative of the j th component 
function. 


Exam P le: Find D 3 f off Q = ( x 4 X + y y4 ) 
Solution: First, find the partial derivatives of /. 


—^— = 15x 2 y 3 
ox 


dfz 

dx 


= 4x 3 


—^— - 15 x 3 y 2 
oy 


dfz 

dx 


= 4y" 


Then find the Hessian of each component function. 


' 30xy 3 45x 2 y 2 ' 

_45x 2 y 2 30x 3 y. 


Hf 2 = 


12x 2 

0 


0 

12y 2 


Then take the partial derivatives with respect to x and y of each Hessian matrix to find D 3 f. 


T 30y 3 90xy 2 l |"90xy 2 90x 2 yl" 

.90xy 2 60x 2 y. .90x 2 y 30x 3 . 


'24x 

01 

0 

0 ' 

- 0 

oJ 

0 

24y 


We can verify the third derivative with the following approximation formula found by taking the difference 
quotient of the second difference quotient: 


f(p + h) - lh 3 f"\p) - ^/(p + 3h) + f(p + 2h) + ^/(p) 


(1 1\ /X 

Example: Find an approximation for / where / 
Solution: Identify p = ^, h — (^''^) 



From the previous example, 



T 30y 3 

90xy 2 l 

'90xy 2 

90x 2 yl" 


r 

30 

901 

r90 

9011 

D 3 f = 

[90xy 2 

[24x 

60x 2 yJ 

01 

_90x 2 y 

0 

30x 3 J 

0 1 

D 3 f\ P = 


90 

'24 

60J 

°1 

1-90 

f° 

O 00 

_ 1 o_ 


l 0 

oJ 

.0 

24yJ 


- 

. 0 

oJ 

1.0 

24-1 J 


Substitute into the approximation formula 
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f(P + h) 


12 

18 

2.4 

0 


30 

901 

r90 

90 

90 

60J 

1-90 

30- 

'24 

°1 

r° 

°1 

- 0 

0-1 

lo 

24-1 


C)OC)-5«^ + «^ + > 


18 
9 . 

0 

2.4JJ 


)f u ) 

1 

r 2 e 4 - 134 ) + 

/14.93A 

+ 1( 5 ) 

>\ o.i/ 

_ 3 

l 5.712 ) 

<4.147/ 

3 <2/ 


i 

3 . 


1 rl.44 
3 1.8 


1.8 

f 0 ' 1 ) 

/8.045a 

/14.93A 

/1.667a 

1.14. 

VO.l/ 

<1.904/ + 

<4.147/ + 

<0.667/ 


^ /0.108A 
V0.098/ 

/l.V 

Check by evaluating / 1.1 

Vl.l/ 


) - ( a045 ) + 

/ 14 .93A 

/1.667a _ 

/ 8.66 A 

' V 1.904/ 

<4.147/ + 

<0.667/ 

<3.008/ 



8.858\ 

2.928/ 


' 8.66 A 
<3.008/ 


The other derivatives can be verified using similar formulas. 


Functions with multiple outputs are impossible for our human minds to visualize, but by using derivatives with 
matrices embedded within matrices, we can discover useful information about functions in higher dimensions. 
We explored what the derivatives look like and how to use them to find approximations. These derivatives 
could also be useful for finding other information about functions such as their maximums, minimums, or 
curvature. In spite of the limitations of the human imagination, math makes exploring higher dimensions 
possible. 
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Exoplanet 51 Pegasi-b, Determination of Orbital Parameter in an 

Astronomy Laboratory 


James Mattiello, Ph.D. 


As an Adjunct Instructor at SC4,1 have the opportunity to teach the introductory Physics and Astronomy 
classes. To stimulate the critical thinking skill of my astronomy students, I like to ask a few basic questions to 
test the astronomical knowledge. First question, “What is the name of the nearest star?”. Most of the time the 
reply is Proxima or Alpha Centauri, which is wrong, the nearest star is our own Sun, Sol. I also like to ask, 
“How many planets do we know about?”. This stimulate a debate, is Pluto included in the count, should the 
other newly discovered Kuiper Belt Objects (dwarf-planets) be included ( Haumea, Makemake, and Erls). The 
answer usually varies between 8 to 14 planets. The students are surprise when I tell them that there are off by a 
few thousand planets. I didn't restricted the planet count to just our solar system. The actual number is in around 
3322, as of 2016. These extra solar system planets, exoplanets, 9 from our own solar system, including Pluto, 
and some 3300 exoplanets around other stars. All these exoplanets have been confirmed via astronomical 
observation and are located within some 30,000 light years distance from Earth. You can check the most current 
number by visiting the exoplanets official websites, http://exoplanets.org or Hanno Rein’s exoplanet app, 
http: // exoplanetapp .com . 

In my astronomy class we leam about the four basic methods used to detection exoplanets; radial velocity 
measurements, transit light curves, direct imaging, and gravitational micro-lensing, (Perryman, 2011). The 
motion of a planet in orbit around a host star causes this star to undergo a reflex motion about the star-planet 
center of mass. Additional planets also produce additional reflex motion about the center of mass. This motion 
results in the periodic perturbation of observable properties, radial velocity, angular position in the sky, and in 
the arrival time of periodic reference signal from the host star. 


Most of the exoplanets discovered to date have been detected using the radial velocity method. This 
method uses precise spectroscopic measurements of the periodic Doppler shift of a reference spectral lines of 
the host star. The line of sight, radial part, of this Doppler shift motion occurs because the host star is moving 
around the center of mass. An away from the Earth motion, produces a Doppler redshift (shift toward longer 
wavelengths) of the spectral line. A toward the Earth motion, produces a Doppler blueshift (shift to shorter 
wavelengths) of the spectral line. The Doppler shift equation used by astronomers to calculate the radial 
velocity is; 


V=S 



3 

c 

) 


eq. 1 


where (X re f) is the reference wavelength, (/.„/„) is the observed wavelength from the host star, and c is the speed 

J 8 

of light (c = 3.0x10 m/s). Note that a Doppler blueshift produces a negative radial velocity, while a Doppler 
redshift produces a positive radial velocity. 


In the astronomy laboratory, the students use published data from radial velocity measurement of star 51 
Pegasi, and its exoplanet 51 Pegasi-b (Marcy, GW, Butler RP, 1997). This star is in the constellation Pegasus 
and is similar in mass, temperature and radius to our own Sun. The date and time for these observation are in a 
Julian date format. The Julian date is used by astronomers as a way of expressing the date and time as a single 
number, counting the days (and fraction of days) since January 1 st , 4713 BC. These observation started on the 
Julian date 2,450,000 which was from October 11, 1995 and through August 9, 1996. 
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The students laboratory assignment is to fit a non-linear function to the radial velocity data. From which 
they can determine the orbital period, the planet’s time to complete one orbit around the host star, and the other 
parameters for this exoplanet, its mass and semi-major axis (distance from the host star). From a preliminary 
examination of the 51 Pegasi's data, it is apparent that it is sinusoidal in nature, with an orbital period of 
approximately 4 days and a radial velocity of about ± 55 m/s. If you want to fit this data using an Microsoft 
excel or Apple's numbers spreadsheet program, a problem arises that these spreadsheets don’t fit a sinusoidal 

function to the data. To solve this issues, the computing software Mathematica’ is used to process the data, the 
Mathematica code is shown in Appendix C at the end of this article. 

The students begin by entering the Julian date (time) data, as a row array, this array is stored as a variable. 
The radial velocity data are entered as another row array, and are stored as a second variable. The dimension for 
the two arrays are checked to making sure that both list have the same number of data points. The two arrays are 
combined and put into a ( x , y) column matrix. Where x is the date (time) and y is the radial velocity. A non¬ 
linear sinusoidal function of the following form is used to fit the data; 

>' = AsLrc((^)jr-C)-i-£) eq. 2 

Where the amplitude A is the radial velocity (in m/s), B is the orbital period (in days), C is the phase and D the 
y-intercept. The first attempt to fit this function to the data results in a poor outcome, see Figure 1. 



Remember that this data is sinusoidal in nature, with a period of approximately 4 days and a radial 
velocity amplitude of about ±55 m/s. For a second attempt, the non-linear fitting algorithm is given some 
assistance, using a guess of A = 55 m/s and B = 4 days as starting values. This second attempt results in a better 
non-linear fit to the data as shown in Figure 2. 


This non-linear fit to the sinusoidal function (equation 2) results in a set of the parameters, the orbital 
period (B = 4.230 days), and a radial velocity of (A = ± 56.978 m/s) which the students are prepared to use to 
determine the other exoplanet properties. The orbital period for exoplanet 51 Pegasi-b, 4.230 days is converted 
into years, 0.01158 years. Using Kepler’s 3 rd law, the orbital period (in years) squared is equal to the semi¬ 
major axis (in AU ) cubed, one can determine the semi-major axis. Remember the Earth is 1 Astronomical 
Units, 1 AU, from our Sun with an orbital period of 1 year. Using the converted orbital period value the semi¬ 
major axis is calculated to be 0.0512 AU. The mass of the exoplanet is calculated using a modified Kepler’s 3 ld 
laws equation, which uses the Doppler shift radial velocity of the host star and the exoplanet’s orbital period, 
the equation is derived in Appendix A and B. From this equation, the mass of 51 Pegasi-b is determine to be 
0.45 Mj. It is calculated in Jupiter masses to compare this exoplanet’s properties to values observed for our own 
solar system. 

The students have derived the three basic parameters of this exoplanet 51 Pegasi-b its, mass (0.45 Mj), 
orbital period (4.230 days), and semi-major axis (0.0512 AU). In keeping with Cindy Wade’s philosophy 
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published in the first volume of this journal, “And Check Your Work”. The students are reminded to compare 
their calculated values with published values. The host star properties are determined from stellar evolution 
models and other astronomical observations and are summarized in Table 1. Also shown in Table 1 are 
published properties of the exoplanet 51 Pegasi-b. From the values listed in Table 1, the experimental error for 
the calculated orbital period and semi-major axis is less than 1%, and the experimental error for the mass was 
approximately 2%. The students are very pleased with their results. 

Remember that 51 Pegasi is similar is size to our own Sun. So one might expect a similar solar system, 
and by u sing our solar system and its giant planets as a reference model. The students are asked to reflex on a 
few questions to complete this astronomy laboratory; 

1. Considering the mass of this planet as well as its distance from the host star, why was the discovery of 
exoplanet 51 Pegasi-b such a surprise? 

2. Comment on the environment of this planet? 

3. Is it possible for this exoplanet to be hospitable to life? 

As a reference, Mercury has an orbital period of 0.241 years (88 days ) and is 0.387 AU from the Sun, 
while Jupiter has an orbital period of 11.863 years (about 4333 days ) and is 5.203 AU. It should be noted that 
this radial velocity method has a bias in the detection of massive planets with small orbits, they are commonly 
referred to as “Hot Jupiters”. Compared to our solar system, where our giant planets are at large distances (> 5 
AU) from our Sun. The location of these Hot Jupiters gives rise to an astronomical mystery; 

• “Why do these Hot Jupiters exist in their small orbits?” 

• “Do these exoplanets migration towards their host star or did they form at these small orbits?” 

Recently the international Astronomical Union has allowed the public the opportunity to name some of 
these star systems and their exoplanets. (Montmerle T, Benventuti P, 2015) The host star 51 Pegasi has been 
named, Helvetios, and its exoplanet 51 Pegasi-b was named, Dimidium. (Beatty JK, 2016). 

References 

Rein H, “Exoplanet app”, http://exoplanetapp.com , 2016. 

Perryman M, The Exoplanet Handbook, Cambridge University Press, 2011. 

Marcy, GW, Butler RP, et.al., “The Planet Around 51 Pegasi", The Astrophysical Journal, 481: 926 - 935, 1997 
June 1. 

Wade C, “And Check your Work”, Continuum, vol. 1, num. 1: 41 - 42, 2016. 

Montmerle T, Benventuti P, et. al., “Public Naming of Planets and Planetary Satellites”, Transactions IAU, Vol. 
XXIXA, Proc. XXVIII IAU General Assembly, August 2012.. 

Beatty JK, Sky & Telescope, April 2016, p 14. 


91 


Volume 2, Number 1, Continuum 



Table 1 

reference http://exoplanets.org/detail75 l_Peg_b 

51 Pegasi properties 

51 Pegasi-b properties 

Mass of Star 

1.054 + 0.03 Me 

Mass of Exoplanet 

0.461 + 0.016 Mj 

Radius of Star 

1.025 + 0.036 Re 

Semi-major Axis 

0.0521 + 0.0008 AU 

[Fe/H] 

0.200 + 0.03 

Orbital Period 

4.2311 + 0.0005 days 

Stellar Spectral Type 

G2 IV 



Temperature 

5787 + 44 K 



Radial Velocity 

55.9 + 0.8 m/s 



Distance to Star 

15.61+0.01 pc 
~ 50.9 light years 



Me and Re represents a solar mass and solar radius. 

Mj represent a Jupiter mass. 


Appendix A. 


If we assume that the exoplanet system is observed in the plane of the exoplanet’s orbit, we can then use 

jV/ V = M V 

the following center of mass equation to calculate the exoplanet’s mass, ,ur '‘- ml . Where the 

parameter are the host star’s mass and Doppler shift radial velocities and the exoplanet’s mass and orbital speed. 
If the system is not view in the plane of the exoplanet but at some inclined angle, it is not possible to measure 
the non-radial part of the radial velocity. It is only possible to obtain a lower limit on the mass of the exoplanet 


and so its mass might actually be larger, ^ 


^ M, 


sin(r)= M 


pivnr! 


where i is the angle of inclination. 


To derive a form of Kepler’s 3 rd law for determining the mass of a planet. We start by deriving an 
equation of the star’s mass in terms of the orbital speed and semi-major axis of the exoplanet; 


tr — f' 

I iTiMr. 




R 


M V 

■ ‘ r V l Lr t r r ■ jn. .vi' .rv. J .:' 

R 


= F 


: L r iiiu r .TV.ir.Ti 


M,, r = 


RV, 


■ 1 r i*l 1 "i! I Y-1 1 1 , -> 


where R is the 


semi-major axis (in m), 

Vorbital speed is the exoplanet’s orbital speed (in m/s) and G is the universal gravitational constant ( 6.67 x 10 11 
Nrn/kg). Using this value of the star’s mass (M star ), and the center of mass equation, we derive the equation for 
calculating the mass of the exoplanet, knowing the Doppler shift radial velocity of the host star and the orbital 
period of the exoplanet. 
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Kepler's 3rd Law, R ' = T 2 


eq. 5 


The Doppler shift radial velocity for our Sun due to the effect of Jupiter is {Vsun = 12.45 m/s), and is 
calculated in Appendix B. We want to compare the exoplanet mass to the mass of Jupiter, so our final equation 
becomes; 


2 JT , 
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r*-«i r ‘ m - _ I, ,r 


2 'V- v 

- - Jf^pfllT T .‘ljllii-Y v r Vi , i* ^l!l 
UP 


= 


r. 

*.|K|T 


V/ 


ila r aL'.rpi'irr ahii 






QjQl 158 jr 
I 1 L 863 >t J 




56.9a ^ 
12.45 


M = 045 


eq. 6 


Appendix B. 

To calculating the Doppler shift radial velocity of our Sun due to Jupiter, we use the center of mass equation; 


ju v = m v 

11 fw ' .Vim *” Jupitr 
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Sophie Germain 


Marie-Sophie Germain was born on April 1 st in Paris in 1776. (Louis XVI and his wife Marie 
Antoinette had assumed power ten months earlier.) She was the middle of three sisters. She was fascinated by 
lectures delivered by Lagrange at the Ecole Polytechnique, and wrote brilliant letters to him signed “M. 
(monsieur) Le Blanc.” Women were not allowed to attend the Ecole at this time, prompting biographer Dora 
Musielak to ask, “How did Germain leam mathematics on her own before sending her analysis to Lagrange?” 
(Musielak, Prime Mystery: The Life and Mathematics of Sophie German , © 2015, p. viii.) 

That was a time when France was perhaps the most powerful country in Europe, and its 
cultural influence was such that nobles, monarchs, and the educated people in many other 
countries often spoke elegant French instead of their native languages. France was also 
among the most scientifically enlightened nations in the world. The Royal Academy of 
Sciences in Paris had among its members some of the most eminent and influential 
mathematicians in history such as Lagrange, Laplace, and Legendre (Luh-ZHON-dra.) 

(Musielak, p. 1) 

Sophie Germain lived 500 meters from La Conciergerie, the prison where Marie Antoinette and many 
others awaited their turn with the guillotine.(Note: the Paris Academy of Sciences devised the metric system 
around this time!) Her father was a politician, possibly a revolutionary, and her childhood would have been 
filled with overheard politics relating the French Revolution. She hid in her father’s library and read 
Archimedes. 

Lagrange’s printed lecture notes became available in 1798 (Musielak p. 33). He wrote about calculus 
and his idea of how to “unite calculus and algebra” and free differential calculus from “the difficulties that 
occur... .that discourage most of those who undertake its study.” (Musielak p. 28.) Around this time Germain 
wrote letters to Lagrange commenting on these lectures. 

In an 18 th century example of “mansplaining,” Professor J.A.J Cousin sent Gennain an algebra text he 
had written and offered to help her with her studies - Sophie at 21 was already far beyond this text. Another 
Parisian scientist apparently insulted her; he would later write 

II etait difficile, Mademoiselle, de me faire sentir plus que vous ne l’avez fait hier, 

1’indiscretion de ma visite et l’improbation de mes hommages, mais il m’etait difficile de 
le prevoir. (Musielak, p. 34.) 

(It would be difficult. Mademoiselle, for anyone to make me feel as you did yesterday, for the 
indiscretion of my visit and the disapproval of my respects to you, but it was difficult for me to predict 
it.) 

The great mathematician Gauss wrote this in a letter to Germain: 

But when a person of the sex, which according to our customs and prejudices, must 
encounter infinitely more difficulties than men to familiarize herself with these thorny 
researches, succeeds nevertheless in surmounting these obstacles and penetrating the most obscure parts 
of them, then without doubt she must have the noblest courage, quite extraordinary talents and a superior 
genius. (Lynn Osen, Women in Mathematics , quoting from E.T. Bell, quoting Gauss.) 
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Sophie Germain produced results in number theory, particularly regarding Fermat’s Last Theorem, and 
Sophie Germain primes are named for her. 

Definition: A prime number p is a Sophie Germain prime if 2p + 1 is also prime. 

The largest Sophie Germain prime known has 388,342 digits. From Wolfram Mathworld: 

“Around 1825, Sophie Germain proved that the first case of Fermat's last theorem is true for such primes, i.e., if 
Pis a Sophie Germain prime, then there do not exist integers x,y , and z different from 0 and none a multiple of 
p such that 

x v + y v — z v 

Sophie Germain’s Identity states that x 4 + 4y 4 = ((x + y) 2 + y 2 )((x — y) 2 + y 2 ) 

Germain died of breast cancer in 1831. 

Questions: 

1. Where did young Sophie hide during the French Revolution? 

2. What great French mathematician received letters from Germain on his lectures? 

3. Did that mathematician consider differential calculus difficult for most students? 

4. What scientific organization whose members included Laplace and Lagrange existed in 
Paris at this time? 

5. How did Professor Cousin condescend to Germain? 

6. How does the Declaration of Independence help us remember something about Sophie 
Germain? 

7. Did Gauss admire her? 

8. Is 11 a Sophie Germain prime? 13? 17? Verify your answers. 

9. Verify Sophie Germain’s Identity directly using algebra. 
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